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(54) Tille: IMPROVED EXPRESSION OF CRY3B INSECTICIDAL PROTEIN IN PLANTS 
(57) Abstract 

The present invention discloses methods and compositions comprising a group of novel expression cassettes which provide significantly 
improved levels of accumulation of Coleopteran inhibitory Cry3B and Cry3B variant amino acid sequences when these are expressed in 
plants. The preferred embodiments of the invention provide at least up to ten fold higher levels of insect controlling protein relative to 
the highest levels obtained using prior compositions. In particular, transgenic maize expressing higher levels of a protein designed to 
exhibit increased toxicity toward Coleopteran pests deliver superior levels of insect protection and are less likely to sponsor development 
of populations of target insects that are resistant to the insecticidally active protein. 
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Improved Expression of Crv3B Insecticidal Protein in Plants 

1.0 Background of the Invention 

1.1 Field of the Invention 

The present invention discloses transgenic plants expressing substantially higher 
levels of insect controlling Bacillus thuringiensis 8-endotoxin. Methods for obtaining such 
plants and compositions, and methods for using such plants and compositions are described. 
Also disclosed are improved polynucleotide cassettes containing preferred protein coding 
sequences which impart the substantially higher levels of insect controlling 8-endotoxins. The 
preferred embodiments of the invention surprisingly provide up to ten fold higher levels of insect 
controlling protein relative to the highest levels obtained using prior compositions. In particular, 
transgenic maize expressing higher levels of a protein designed to exhibit increased toxicity 
toward Coleopteran pests deliver superior levels of insect protection and are less likely to 
sponsor development of populations of target insects that are resistant to the insecticidally active 
protein. 

1.2 Description of the Related Art 

Almost all field crops, plants, and commercial fanning areas are susceptible to attack 
by one or more insect pests. Particularly problematic are Coleopteran and Lepidopteran pests. 
Because crops of commercial interest are often the target of insect attack, environmentally- 
sensitive methods for controlling or eradicating insect infestation are desirable. This is 
particularly true for farmers, nurserymen, growers, and commercial and residential areas which 
seek to control insect populations using ecologically friendly compositions. 

The most widely used environmentally-sensitive insecticidal formulations developed 
in recent years have been composed of microbial protein pesticides derived from the bacterium 
Bacillus thuringiensis, a Gram-positive bacterium that produces crystal proteins or inclusion 
bodies which are specifically toxic to certain orders and species of insects. Many different 
strains of B. thuringiensis have been identified which produce one or more insecticidal crystal 
proteins as well as other insecticidal non-crystal forming proteins. Compositions including B. 
thuringiensis strains which produce insecticidal proteins have been commercially available and 
used as environmentally acceptable insecticides because they are quite toxic to specific target 
insect pests, but are harmless to plants and to vertebrate and invertebrate animals. More 
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importantly, because these insect controlling proteins have to be ingested by susceptible target 
insect pests in order to exert their insecticidal or toxic effects, judicious application of such 
protein compositions limits or prevents non-target insect members of the susceptible order which 
may also be susceptible to the composition from significant exposure to the proteins (for 
example, non-target Lepidopteran species where Lepidopteran specific B.t. crystal protein is 
used in an insecticidal formulation). Additionally, insects of various orders have been shown to 
totally lack susceptibility to specifically targeted insecticidal proteins even when ingested in 
large amounts. 

1.2.1 8-Endotoxins 

5-endotoxins are used to control a wide range of plant-eating caterpillars and beetles, 
as well as mosquitoes. These proteins, also referred to as insecticidal crystal proteins, crystal 
proteins, and Bt toxins, represent a large collection of insecticidal proteins produced by B. 
thuringiensis that are toxic upon ingestion by a susceptible insect host. Over the past decade 
research on the structure and function of B. thuringiensis toxins has covered all of the major 
toxin categories, and while these toxins differ in specific structure and function, general 
similarities in the structure and function are assumed. A recent review describes the genetics, 
biochemistry, and molecular biology of Bt toxins (Schnepf et al., Bacillus thuringiensis and its 
Pesticidal Crystal Proteins, Microbiol. Mol. Biol. Rev. 62:775-806, 1998). Based on the 
accumulated knowledge of B. thuringiensis toxins, a generalized mode of action for B. 
thuringiensis toxins has been created and includes: ingestion by the insect, solubilization in the 
insect midgut (a combination stomach and small intestine), resistance to digestive enzymes 
sometimes with partial digestion by gut specific proteases catalyzing specifically a cleavage at a 
peptide site within a protoxin structure which "activates" the toxin, binding of the toxin to the 
midgut cells' brush border, formation of a pore in the insect midgut cell, and the disruption of 
cellular homeostasis (English and Slatin, 1992). 

1.2.2 Genes Encoding Crystal Proteins 

Many of the 8-endotoxins are related to various degrees by similarities in their amino 
acid sequences. Historically, the proteins and the genes which encode them were classified 
based largely upon their spectrum of insecticidal activity. A review by Hofte and Whiteley 
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(1989) discusses the genes and proteins that were identified in B. thuringiensis prior to 1990, and 
sets forth the nomenclature and classification scheme which has traditionally been applied to B. 
thuringiensis genes and proteins. The original nomenclature took advantage of the discovery 
that the few Bt Cry proteins known at the time generally fell into a limited number of classes, 
wherein each class represented proteins having specificity for specific orders of insects. For 
example, cryl genes encoded Lepidopteran-toxic Cryl proteins, cryl genes encoded Cry2 
proteins that were generally toxic to both Lepidopterans as well as to Dipterans. cry3 genes 
encoded Coleopteran-toxic Cry3 proteins, while cryA genes encoded Dipteran-specific toxic 
Cry4 proteins. The nomenclature has, for the past decade or more become rather confusing with 
the discovery of more distantly related classes of insecticidal Bt proteins. More recently, a 
simplified homogeneous nomenclature and basis for classifications of Bt proteins has been 
adopted and has been reviewed by Schnepf et al. (1998). Schnepf et al. (1998) also provides a 
structural solution for a Cryl crystal. This simplified nomenclature will be adopted herein. The 
convention of identifying Bt genes with lower case, italicized letters (eg. crylAbl) and 
identifying Bt proteins with uppercase first character (eg. CrylAbl) will also be observed herein 

Based on the degree of sequence similarity, the proteins have been further classified 
into subfamilies. Proteins which appeared to be more closely related within each family were 
assigned divisional letters such as Cryl A, Cry IB, CrylC, etc. Even more closely related 
proteins within each division were given names such as CrylCa, CrylCb, etc. and still even 
more closely related proteins within each division were designated with names such as CrylBbl, 
CrylBb2, etc. 

The modern nomenclature systematically classifies the Cry proteins based upon 
amino acid sequence homology rather than upon insect target specificities. The classification 
scheme for many known toxins, not including allelic variations in individual proteins, is 
summarized in regularly updated tables which can be obtained from Dr. Neil Crickmore at 
http://epunix.biols.susx.ac.uk/Home/Neil_Crickmore/Bt/index.html. 

1.2.3 BlO-INSECTIClDE POLYPEPTIDE COMPOSITIONS 

The utility of bacterial crystal proteins as insecticides was extended beyond 
Lepidopterans and Dipteran larvae when the first isolation of a Coleopteran-toxic B. 
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thuringiensis strain was reported (Krieg et al, 1983; 1984). This strain (described in U.S. Patent 
4,766,203, specifically incorporated herein by reference), designated B. thuringiensis var. 
tenebrionis, was reported to be toxic to larvae of the Coleopteran insects Agelastica alni (blue 
alder leaf beetle) and Leptinotarsa decemlineata (Colorado potato beetle). 

U. S. Patent 5,024, 837 also describes hybrid B. thuringiensis var. kurstaki strains 
which showed activity against Lepidopteran insects. U. S. Patent 4,797,279 (corresponding to 
EP 0221024) discloses a hybrid B. thuringiensis containing a plasmid from B. thuringiensis var. 
kurstaki encoding a Lepidopteran-toxic crystal protein-encoding gene and a plasmid from B. 
thuringiensis tenebrionis encoding a Coleopteran-toxic crystal protein-encoding gene. The 
hybrid B. thuringiensis strain produces crystal proteins characteristic of those made by both B. 
thuringiensis kurstaki and B. thuringiensis tenebrionis. U. S. Patent 4,910,016 (corresponding to 
EP 0303379) discloses a B. thuringiensis isolate identified as B. thuringiensis MT 104 which has 
insecticidal activity against Coleopterans and Lepidopterans. More recently, Osman et al. 
disclosed a natural Bacillus thuringiensis isolate which displayed activity against at least two 
orders of insects and against nematodes (WO 98/30700). 

It has been known for more than two decades that compositions comprising Bt 
insecticidal proteins are effective in providing protection from insect infestation to plants treated 
with such compositions. More recently, molecular genetic techniques have enabled the 
expression of Bt insecticidal proteins from nucleotide sequences stably inserted into plant 
genomes (Perlak et al., Brown & Santino, etc.). However, expression of transgenes in plants has 
provided an avenue for increased insect resistance to Bt's produced in plants because plants have 
not been shown to produce high levels of insecticidal proteins. It was initially believed that 
gross morphological or topological differences in gene structure and architecture between plant 
and bacterial systems was the limitation which prevented over-expression of Bt transgenes in 
plants. These differences were seemingly overcome as disclosed by Perlak et al. (US Patent No. 
5,500,365) and by Brown et al. (US Patent No.'s 5,424,412 and 5,689,052) wherein transgenes 
encoding Bt insecticidal protein which contained plant preferred codons were shown to improve 
the levels of expression. Alternatively, truncating the protoxin coding domain to the shortest 
peptide coding domain which still encoded an insecticidal protein was also deemed sufficient to 
overcome the limitation of vanishingly low expression levels of the Bt encoding transgene in 
planta. Expression levels of Bt proteins in planta from transgenes has varied widely independent 
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of the means used for expression, and accumulated protein levels have ranged from virtually 
undetectable to 2 parts per million to around 20 to 30 parts per million. However, even though 
all of these approaches provided improved levels of Bt protein accumulation in plants, none 
provided levels of expression which could ensure that insect resistance would not become a 
problem without the necessity of coordinate expression of one or more additional insecticidal 
toxins by the transgenic plant, or alternatively without the coordinate topical application of 
additional supplemental Bt or insecticidal chemical compositions. 

The importance of accumulation of higher levels of Bt toxin for preventing insect 
resistance to individual Bt toxins has been understood for some time. Various laboratory studies 
in which selection against Bt was applied over several generations of insects have confirmed that 
resistance against Bt insecticidal proteins is seldom obtained. It should be emphasized that 
laboratory conditions represent rather low but constant selection pressure conditions, allowing 
for the survival of a sub-population of insects which have been subjected to insecticidal pressure 
and which produce the subsequent generations of insects. Succeeding generations are also 
maintained on media containing low but constant concentrations of insecticidal protein. 
Generally, concentrations used for selection pressures range from LC40 to around LC60 or so, 
however, LC95 concentrations have also tested for the development of resistance. In most cases, 
resistance is acquired slowly, generally developing within a reasonably few generations, for 
example 10-50 generations. However, such resistance is not observed where substantially higher 
levels of toxin are used, or in situations in which multiple toxins are provided. 

At present, recombinant plants expressing commercially useful levels of Bt 
insecticidal protein generally contain only one gene encoding a single class of Bt. Such plants 
are anticipated to have a very limited duration of use for two reasons. First, these plants are 
expressing insufficient levels of the insecticidal protein to ensure that all target insects exposed 
to and feeding from the plant tissues will succumb due to the dose of toxin ingested. Second, 
because of the insufficient insecticidal protein levels, the potential for development of resistance 
is unreasonably increased. This is not to say that the level of toxin produced by such transgenic 
plants is insufficient to be effective. This merely represents the limitations of expression of 8- 
endotoxins in planta even when using sequences encoding Bt 5-endotoxin which have been 
modified to conform to plant preferred sequences. One limitation which has been observed for 
many Bt 5-endotoxin encoding sequences modified for expression in plants is that is has been 
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impossible to predict which Bt 8-endotoxin would be effective for expression in plants. ( For 
example, expression of Cry2Aa in cotton plants results in phytotoxicity when targeted to the 
chloroplast, however expression of a closely related cry2Ab sequence is not phytotoxic when 
targeted to the chloroplast. (Corbin et al., US Patent Application, Serial No. 09/1 86,002 ). Even 
so, levels of 8-endotoxin protein produced in plants is not sufficient to be effective against all 
desired target insect species known to be susceptible to a given type and class of 8-endotoxin. 

As indicated above, alternative approaches to development of resistance to 
insecticidal proteins has included ineffective attempts to increase the expression levels of 
transgenes in plants. Alternatively, additional insecticidal genes could be engineered into plants 
so that multiple toxins are coordinately expressed. This would provide a more effective means 
for delaying the onset of resistance to any one combination of Bt's, however, this still does not 
overcome the limitation of insufficient levels of insecticidal protein accumulating in the 
recombinant plant(s). An additional alternative to insufficient levels of expression has been to 
engineer genes encoding Bt insecticidal crystal proteins which demonstrate improved insecticidal 
properties, having either a broader host range or an increased biological activity, which could 
conceivably result in requiring less of the recombinant protein to control a target insect species 
than was required of the native form of the protein. 

The combination of structural analyses of B. thuringiensis toxins followed by an 
investigation of the function of such structures, motifs, and the like has taught that specific 
regions of crystal protein endotoxins are, in a general way, responsible for particular functions. 

Domain 1, for example, from Cry3Bb and Cry 1 Ac has been found to be responsible 
for ion channel activity, the initial step in formation of a pore (Walters et al, 1993; Von Tersch 
et al, 1994). Domains 2 and 3 have been found to be responsible for receptor binding and 
insecticidal specificity (Aronson et al, 1995; Caramori et al, 1991; Chen et al 1993; de Maagd 
et al, 1996; Ge et al, 1991; Lee et al, 1992; Lee et al, 1995; Lu et al, 1994; Smedley and 
Ellar, 1996; Smith and Ellar, 1994; Rajamohan et al, 1995; Rajamohan et al, 1996; Wu and 
Dean, 1996). Regions in domain 2 and 3 can also impact the ion channel activity of some toxins 
(Chen et al, 1993, Wolfersberger et al, 1996; Von Tersch et al, 1994). 

Unfortunately, while many investigators have attempted, few have succeeded in 
making mutated crystal proteins with improved insecticidal toxicity. In almost all of the 
examples of genetically-engineered B. thuringiensis toxins in the literature, the biological 
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activity of the mutated crystal protein is no better than that of the wild-type protein, and in many 
cases, the activity is decreased or destroyed altogether (Almond and Dean, 1993; Aronson et al, 
1995; Chen et al, 1993, Chen et al, 1995; Ge et al, 1991; Kwak et al, 1995; Lu et al, 1994; 
Rajamohan et al, 1995; Rajamohan et al, 1996; Smedley and Ellar, 1996; Smith and Ellar, 
1994; Wolfersberger et al, 1996; Wu and Aronson, 1992). However, Van Rie et al. have 
recently accomplished the improvement of a Cry3A 5-endotoxin having increased Coleopteran 
insecticidal activity by identifying a single mutant having increased insecticidal activity. Van 
Rie et al. propose a method for identifying mutants having increased insecticidal activity in 
which the method consists of identifying amino acid mutations which decrease the insecticidal 
activity, and selectively altering those residues by site directed mutagenesis to incorporate one or 
more of the naturally occurring 20 amino acids at those positions, and feeding the various forms 
of the resulting altered protein to western or northern corn rootworms to identify those having 
improved activity (US Patent 5,659,123). While no sequences were enabled using the method, 
as mentioned above, Van Rie et al. succeeded in identifying only one sequence having increased 
activity and did not demonstrate an increase in expression of the mutant form as compared to the 
native sequence. 

For a crystal protein having approximately 650 amino acids in the sequence of its 
active toxin, and the possibility of 20 different amino acids at each position in this sequence, the 
likelihood of arbitrarily creating a successful new structure is remote, even if a general function 
to a stretch of 250-300 amino acids can be assigned. Indeed, the above prior art with respect to 
crystal protein gene mutagenesis has been concerned primarily with studying the structure and 
function of the crystal proteins, using mutagenesis to perturb some step in the mode of action, 
rather than with engineering improved toxins. 

Collectively, the limited successes in the art to develop non-naturally occurring toxins 
with improved insecticidal activity have stifled progress in this area and confounded the search 
for improved endotoxins or crystal proteins. Rather than following simple and predictable rules, 
the successful engineering of an improved crystal protein may involve different strategies, 
depending on the crystal protein being improved and the insect pests being targeted. Thus, the 
process is highly empirical. 

Accordingly, traditional recombinant DNA technology is clearly not routine experimentation for 
providing improved insecticidal crystal proteins. What has been lacking in the prior art are rational methods for 
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producing genetically-engineered B. thuringiensis crystal proteins that have improved insecticidal activity and, in 
particular, improved toxicity towards a wide range of Lepidopteran, Coleopteran, or Dipteran insect pests. Methods 
and compositions which address these concerns were disclosed in US Patent Application No. 08/993,170 (December 
18, 1997; English et al.) and other related US applications (08/993,722, Dec. 18, 1997, English et al.; 08/993,755, 
Dec. 18, 1997, English et al.; and 08/996,441, Dec. 18, 1997, English et al.) and in Van Rie et al. (US Patent 
5,659,123, Jun 1, 1999). In addition, recombinant^ improved 5-endotoxins have continued to be expressed poorly 
and/or cause phytoxic effects when expressed in plants, thus leading to the recovery of fewer commercially useful 
transgenic events. 

2.0 Summary of the Invention 

Described herein are novel compositions and methods for expressing in transformed 
plants variant Cry3 B. thuringiensis S-endotoxins having significant Coleopteran inhibitory 
activity. These compositions and methods advantageously result in plants expressing B. 
thuringiensis Cry3 8-endotoxins at increased levels not previously observed for Cry 8- 
endotoxins. Increased levels of Cry3 8-endotoxin expression are reflected in the attainment of 
higher maximal expression levels in individual transgenic insertion events. Unexpectedly, the 
particular compositions disclosed herein result in the recovery of an increased percentage of 
transgenic events which manifest expression levels that far exceed threshold levels of expression 
necessary for Coleopteran insect control and which provide sufficient toxin levels capable of 
supporting a resistance management strategy. Since Cry3 8-endotoxins are typically less potent 
than other 8-endotoxins commonly used to control Lepidopteran or Dipteran target pests when 
expressed in transgenic plants, attainment of higher maximal levels of Cry3 8-endotoxin 
expression and recovery of more transgenic events with effective expression levels are both 
critical in isolating transgenic events expressing Cry3 8-endotoxin which exhibit commercially 
useful levels of target insect control. 

Another limitation of the prior art addressed by the present invention is the development 
of insect resistance to 8-endotoxins provided by plant expression. Specifically, the instant 
invention provides a superior strategy for the delay or elimination of the development of 
resistance to Cry3 8-endotoxins through improved accumulation of 8-endotoxin within plant 
cells so that levels of the 8-endotoxin are maintained in-planta above a threshold level of protein, 
typically measured in parts per million (ppm). Improved expression of 8-endotoxins, which also 
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should be taken to mean increased expression in view of what has been previously observed in 
the art, is believed to result in delayed onset of insect resistance and thus extends the utility of 
plant expressed 5-endotoxins as insect control agents. 

In preferred embodiments, the present invention provides isolated and purified novel 
Cry3B 6-endotoxin proteins exhibiting particularly effective insecticidal activity directed toward 
controlling Coleopteran pest insect species. Such 8-endotoxin proteins of the present invention 
are provided by expression from isolated, purified and improved or enhanced DNA or 
polynucleotide sequences each comprising a Cry3 8-endotoxin coding sequence placed under the 
control of preferred plant functional gene expression elements such as a promoter, an 
untranslated leader sequence, an intron and a transcription termination and polyadenylation 
sequence. Some preferred DNA or polynucleotide sequences may also provide for plastid or 
chloroplast targeting protein sequences. Preferred DNA constructs of the present invention 
include those constructs which encode Cry3 5-endotoxins exhibiting Coleopteran-inhibitory or 
Coleopteran-controlling activity. In an illustrative embodiment, polynucleotide sequences are 
assembled into an expression cassette for introduction into plant genomic DNA, wherein the 
expression cassette comprises a Cry3Bb 8-endotoxin variant coding sequence operably linked to 
a sequence comprising a promoter, an untranslated leader sequence, an intron and a transcription 
termination and polyadenylation sequence. In particular, a transgene localized within a plant 
operable polynucleotide expression cassette or polynucleotide sequence comprising an 
expression cassette which is comprised of genetic elements which function in plant cells to 
express a desired protein from a nucleic acid coding sequence (the transgene) which is operably 
localized within said expression cassette. The coding sequence is linked upstream to at least a 
promoter sequence, an untranslated leader sequence (UTL), an intron sequence, and in-frame in 
certain indicated embodiments to a sequence encoding a plastid or chloroplast targeting peptide. 
The coding sequence is also linked downstream to at least a plant functional transcription 
termination and polyadenylation sequence. Polynucleotide sequences comprising such an 
expression cassette are shown herein to improve expression of the desired protein encoded from 
within the cassette, improve the number of events obtained from the use of the polynucleotide 
sequence in plant transformation, wherein said improved number of events contain the desired 
transgene localized within the expression cassette and exhibit improved levels of expression of 
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one or more desired proteins. The improved number of events are also surprisingly observed to 
express the desired protein at levels above 2 to 5 parts per million but in general below 200 to 
500 parts per million of total cell protein. Even more surprising were some events in particular 
which expressed the desired protein at levels well above 500 ppm. Indicated embodiments 
disclose a sequence encoding a variant Cry3Bb 8-endotoxin comprising the isolated and purified 
SEQ ID NO:9, from Ncol to EcoKl as set forth in Figure 1 illustrating plasmid pMON25096. 
Yet other embodiments disclose a variant Cry3Bb 5-endotoxin coding sequence comprising an 
isolated and purified SEQ ID NO:l 1, from Ncol to EcoRl as set forth in Figure 2 illustrating 
plasmid pMON3374l. It is contemplated, however, that any Cry3 8-endotoxin exhibiting 
substantial Coleopteran-inhibitory or Coleopteran-controlling activity greater than or equal to 
that disclosed in the present invention could be utilized according to the embodiments of the 
present invention, with those Cry3 proteins bearing substantial homologies to Cry3Bb being 
particularly preferred. 

In a preferred embodiment, the invention provides for transgenic plants which have been 
transformed with a DNA construct or expression cassette of the present invention that is 
expressed and translated at unexpectedly high levels by the plant which results in surprisingly 
high levels of 8-endotoxin accumulation. Monocotyledenous plants may be transformed 
according to the methods and with the DNA constructs disclosed herein. However, it is also 
anticipated that dicotyledenous plants could also be transformed with DNA sequences disclosed 
herein by one skilled in the art in order to obtain transgenic plants providing unexpectedly useful 
levels of insect resistance without the risk of development of insect resistance to the 8-endotoxin. 
The plant transformed by the instant invention may be prepared, in a further preferred 
embodiment, by a process including obtainment of the isolated and purified DNA construct 
contained within the expression cassette, and then transforming the plant with the construct so 
that the plant expresses the protein for which the construct encodes. Alternatively, the plant 
transformed by the instant invention may be prepared, in a further preferred embodiment, by a 
process including introduction of the isolated and purified DNA construct into a transformation 
competent Agrobacterium strain, and then transforming the plant with the Agrohacterium strain 
containing the construct so that the plant expresses the proteins for which the construct encodes. 
It has been observed herein that transformation of plants by the disclosed compositions and 
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methods results surprisingly in increased frequencies of transformants exhibiting transgene 
expression as well as in the recovery of individual transgenic events exhibiting unexpectedly 
higher absolute levels of transgene expression. 

It is contemplated that the increased expression levels observed in the disclosed invention will 
allow for reduced development of insect resistance to Bt 8-endotoxins presented to target insect 
pests. This may be achieved by transforming a plant with the preferred DNA construct to 
achieve high rates of Cry3 expression alone, or by simultaneously exposing target insects to the 
disclosed Cry3 8-endotoxins along with other compositions effective in controlling Coleopteran 
species such as variants of Cry3B (English et al., WO 99/31248), variant Cry3A or variant 
Cry3D (US Patent 5,659,123), CryET33 and CryET34 (Donovan et al., WO 97/17600), 
CryET70 (US Application Serial No. 09/184,748; Mettus et al., November 2, 1998), Cry6A, 
Cry6B, Cry8B (US Pat. No. 5,277,905), CryET29 (Rupar et al., WO 97/21 587), insecticidal acyl 
lipid hydrolases, combinations of amino acid oxidases and tedanalactam synthases (Romano et 
al., US Application Serial No. 09/063,733, filed April 21, 1998), or insecticidal proteins such as 
VIP1 (Gay, WO 97/26339; Gourlet et al, WO 98/02453) and VIP3 (Estruch et al., US Pat. No. 
5,877,012; 1999) among others. Susceptible target insects include Diabroticus spp. Wire Worm 
in Zea mays and Leptinotarsa decemlineata (Say) in Solarium tuberosum, and Boll Weevil in 
Gossypium species (cotton). 

It is therefore contemplated that the compositions and methods disclosed by the present 
invention will provide many advantages over the prior art including those specifically outlined 
above. Other advantages include improved control of susceptible target insect pests and 
achieving season long protection from insect pathogens. An additional advantage of the present 
invention provides for reducing the number of transgenic events that have to be screened in order 
to identify one which contains beneficial levels of one or more insect controlling compositions. 
The present invention also encompasses cells transformed with the DNA constructs disclosed 
herein. Also, transformation vectors such as plasmids, bacmids, artificial chromosomes, viral 
vectors and such are contemplated as elements for use in delivering the nucleotide compositions 
of the present invention into contemplated cells in order to obtain transformed host cells, both 
prokaryotic and eukaryotic, which express the 5-endotoxin proteins encoded by the novel DNA 
construct disclosed herein. It is further contemplated that in some instances the genome of a 
transgenic plant of the present invention will have been augmented through the stable integration 
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of an expression cassette encoding a Coleopteran inhibitory or controlling B. thuringiensis 6- 
endotoxin or variants thereof as described herein. Furthermore, more than one transgene 
encoding an insecticidal composition will be incorporated into the nuclear genome, or 
alternatively, into the chloroplast or plastid genome of the transformed host plant cell. It is 
envisioned that more than one polynucleotide encoding an insecticidal crystal protein will be 
incorporated into the genome of a plant cell and it may be desirable to have two or even more 
sequences encoding insecticidal or other plant beneficial proteins within the nucleotide 
sequences contained within the cell. Such recombinantly derived proteins may exist as 
precursors, pro-toxins, or as fusions of beneficial proteins linked by flexible amino acid linker 
sequences or by protease specific cleavage sequences well known in the art. Chimeras 
comprising fusions of insecticidal proteins are also envisioned. The offspring of transgenic plant 
host cells can be manipulated artificially to produce whole recombinant plants exhibiting 
improved insecticidal properties, and the recombinant nucleotide sequences are shown herein to 
be heritable. The heritability of the elements is a preferred aspect of this invention, so that the 
expression elements are able to be delivered to lineal descendants of the original transformed 
host plant cell, giving rise first to a stably transformed plant whose constituent cells express the 
desired transgene, albeit tissue specific expression can be selectively manipulated generally 
through the choice of plant operable promoter selected for use in a given expression cassette, as 
described above. Transformed plants give rise to seeds containing the heritable expression 
cassette, and the seeds thus give rise to plants in lineal fashion which contain the expression 
cassette, generally in Mendelian fashion, particularly when selfed according to well known 
methods in the art. 

3.0 Brief description of the drawings 

Figure 1 illustrates plasmid pMON25096. 
Figure 2 illustrates plasmid pMON33741. 
Figure 3 illustrates plasmid pMON25097. 
Figure 4 illustrates plasmid pMON33748. 

Figure 5 illustrates the nucleotide and amino acid sequence translation of a variant 
Cry3Bb.l 1098 insecticidal protein as shown in SEQID NO:9. 
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Figure 6 illustrates the nucleotide and amino acid sequence translation of a variant 
Cry3Bb. 11231 insecticidal protein as shown in SEQID NO: 1 1 . 

4.0 Detailed description of the invention 

The following detailed description of the invention is provided to aid those skilled 
in the art in practicing the present invention. Even so, the following detailed description should 
not be construed to unduly limit the present invention as modifications and variations in the 
embodiments discussed herein may be made by those of ordinary skill in the art without 
departing from the spirit and scope of the present invention. 

4.1 Definitions 

The following words and phrases have the meanings set forth below. 

Biological functional equivalents. As used herein such equivalents with respect to the 
insecticidal proteins of the present invention are peptides, polypeptides and proteins that contain 
a sequence or moiety exhibiting sequence similarity to the novel peptides of the present 
invention, such as Cry3Bb.l 123 1, and which exhibit the same or similar functional properties as 
that of the polypeptides disclosed herein, including insecticidal activity. Biological equivalents 
also include peptides, polypeptides and proteins that react with, i.e. specifically bind to 
antibodies raised against Cry3Bb and that exhibit the same or similar insecticidal activity, 
including both monoclonal and polyclonal antibodies. 

Combating or Controlling Insect Damage in an agricultural context refers to reduction 
of damage in relative units to a crop or plant part caused by infestation of an insect pest. More 
generally, this phrase refers to reduction in the adverse effects caused by the presence of an 
undesired insect in any particular location. 

Event refers to a transgenic plant derived from one of the following: 

1 . the insertion of foreign DNA into one or more unique sites in the nuclear genomic 
DNA; 

2. the insertion of foreign DNA into one or more unique sites in the plastid, chloroplast 
or mitochondrial genome; 
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3. the introduction of a stable, heritable, epigenetic vector into the cytoplasm of a 
plastid, chloroplast, or mitochondria; or 

4. a combination of any of the foregoing processes. 

Events derived from these processes contain an expression cassette expressing a desired 
coding sequence as described herein. Events are also referred to as ITE's (independent 
transformation events). 

Expression: The combination of intracellular processes, including transcription, 
translation, and other intracellular protein and RNA processing and stabilization functions, 
undergone by a nucleic acid coding sequence controlled by genetic sequences which function in 
plant cells to achieve production of a desired product, such as a structural gene encoding an RNA 
molecule, or an RNA molecule being used as a substrate for a reverse transcriptase enzyme or 
enzyme complex. 

Improved or enhanced expression cassette refers to the specific combination and order 
of genetic elements associated with the insecticidal protein encoding sequence which, when 
expressed within a plant cell: 

gives rise to the surprising average level of that protein expressed in plants, plant 
tissue, or plant cells; 

gives rise to the unexpected number of transformation events expressing a 
surprisingly higher average level of insecticidal protein; 

gives rise to individual plants, plant tissue, or plant cells expressing an 
unexpectedly high level of the insecticidal protein; and 

gives rise to plants expressing unexpected levels of insecticidal protein effective 
in controlling or combating Coleopteran pests and preventing development of resistance 
by the Coleopteran pest to the particular insecticidal protein. 

Insecticidal polypeptide refers to a polypeptide having insecticidal properties, e.g., a. 
polypeptide which exhibits the properties of inhibiting the growth, development, viability or 
fecundity of target insect pests. 

Operably Linked: Nucleic acid or polynucleotide sequences connected sequentially in 
linear form, so that the properties of one influence the expression characteristics of the other. A 
promoter, for example, operably linked to other polynucleotide sequences (which may consist of 
operator or enhancer sequences, untranslated or translated leader sequences, intron sequences, 
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structural gene coding sequences, non-structural genes, transcription and translation termination 
sequences, and polyadenylation sequences) influences the expression of a coding or noncoding 
sequence, whether the product is RNA, protein, or other product. Similarly, an intron or an 
untranslated leader sequence can influence the expression and stability of sequences operably 
linked to them, and structural or non-structural gene sequences can be influenced by elements 
operably linked upstream, within, or downstream. 

Plant-Expressible Coding Regions: Amino acid coding regions or open reading frames 
(ORFs) which are expressible in planta because they contain typical plant regulatory elements 
facilitating their expression, and often include changes to the coding sequence such that plant 
preferred codons are utilized in place of non-preferred codons where heterologous coding 
regions are contemplated. 

Plastid Transit Peptide: Any amino acid sequence useful in targeting a linked amino 
acid, such as a protein fusion, to a subcellular compartment or organelle such as a plastid or 
chloroplast. 

Polynucleotide sequence: Any DNA or RNA sequence of four or more consecutive 
nucleotides or ribonucleotides. Generally polynucleotide sequences as disclosed herein comprise 
at least 50 or more nucleotides or ribonucleotides. 

Progeny: "Progeny" includes any offspring or descendant of the transgenic plant, or any 
subsequent plant which contains the transgene(s) in operable form. Progeny is not limited to one 
generation, but rather encompasses the transformant's descendants so long as they contain or 
express the transgene(s). Seeds containing transgenic embryos as well as seeds from the 
transgenic plants and their offspring or descendants which, after Mendelian segregation continue 
to contain the transgene(s), are also important parts of the invention. 

Promoter: A recognition site on a DNA sequence or group of DNA sequences that 
provide an expression control element for a preferred polynucleotide sequence and to which 
RNA polymerase specifically binds and initiates RNA synthesis (transcription) of that preferred 
sequence. 

Ro is the primary regenerant plant derived from transformation of plant tissue or cells in 
culture. Subsequent progeny or generations derived from the R 0 are referred to as Ri (first 
generation), R2 (second generation), etc. 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 



PCT/US99/18883 



-16- 

Regeneration: The process of growing a plant from a plant cell or group of plant cells 
(e.g., plant protoplast, embryo, callus, or explant). 

Structural Coding Sequence refers to a DNA sequence that encodes a peptide, 
polypeptide, or protein that is made by a cell following transcription of the structural coding 
sequence to messenger RNA (mRNA), followed by translation of the mRNA to the desired 
peptide, polypeptide, or protein product. 

Structural gene: A gene or polynucleotide sequence containing the coding sequence of 
a desired polypeptide that is expressed by transcription and translation to produce the desired 
polypeptide. 

Synthetic gene: Synthetic genes encoding the B. thuringiensis 5-endotoxins of the 
present invention are those prepared in a manner involving any sort of genetic isolation or 
manipulation which alters the naturally occurring coding sequence of the S-endotoxin gene. This 
includes isolation of the gene from its naturally occurring state, manipulation of the gene as by 
codon modification (as described herein), or site-specific mutagenesis (as described herein), 
truncation of the gene or any other manipulative or isolative method. A synthetic gene can also 
be a polynucleotide sequence which is not known to be naturally occurring but which encodes a 
useful polypeptide or other product such as a tRNA or an antisense polynucleotide. A non- 
naturally occurring polynucleotide sequence. 

Substantial homology: As this term is used herein, it refers to nucleic acid or 
polypeptide sequences which are about 86% homologous, to about 90% homologous, to about 
95% homologous, to about 99% homologous. More specifically, the inventors envision 
substantial homologues to be about 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, and 99 
percent homologous to the referent nucleic acid sequence of polypeptide. 

Terminator: With reference to eukaryotic nuclear gene expression processes, the 
operable 3' end transcription termination and polyadenylation sequence. With reference to 
prokaryotic gene expression, and including plastid or chloroplast gene expression, the operable 
DNA sequence at the 3' end of an open reading frame which, for ORF's expressing protein 
product, at least one termination codon in frame with the coding sequence of the ORF, which 
may also be followed by a DNA sequence encoding a transcription termination signal which may 
cause the translated RNA or mRNA product to form a hairpin or other three dimensional 
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structure which may or may not act together with one or more soluble structural proteins to cause 
transcription to be interrupted. 

Transformation: A process of introducing an exogenous polynucleotide sequence (e.g., 
a vector, or a recombinant or non-recombinant DNA or RNA molecule) into a cell or protoplast 
in which that exogenous polynucleotide is incorporated into a heritable genetic element or is 
capable of autonomous replication and thus stably maintained within that cell or protoplast as 
well as in the progeny of that cell or protoplast. 

Transformed cell: A cell which contains a heritable genetic element altered by the 
introduction of one or more exogenous DNA molecules. A transgenic cell. Exemplary 
transformed or transgenic cells include plant calli derived from a transformed plant cell and 
particular cells such as leaf, root, stem, e.g., somatic cells, or reproductive (germ) cells obtained 
from a transgenic plant. 

Transgene: A gene construct, expression cassette, or DNA segment or sequence 
comprising an ORF which is desired to be expressed in the recipient cell, tissue or organism. 
This may include an entire plasmid, or other vector, or may simply include the functional coding 
sequence, region, domain, or segment of the transferred DNA sequence. 

Transgenic event: A plant or progeny thereof derived from a plant cell or protoplast 
manufactured or constructed to contain one or more exogenous DNA molecules inserted into the 
nuclear or other genome of the plant cell, or introduced and stably maintained within the 
cytoplasm of a plastid, chloroplast, or mitochondria, which confers some physically detectable 
phenotype upon the plant or progeny thereof. 

Transgenic plant: A plant or progeny thereof which has been genetically modified to 
contain and express heterologous DNA sequences either as proteins or as nucleic acids. As 
specifically exemplified herein, a transgenic corn plant is genetically modified to contain and 
express at least one heterologous DNA sequence operably linked to and under the regulatory 
control of transcriptional control sequences which function together in plant cells or tissue or in 
whole plants to achieve expression from a nucleic acid sequence encoding an insecticidal 8- 
endotoxin protein or an amino acid sequence variant thereof. A transgenic plant may also be 
referred to as a transformed plant. A transgenic plant also refers to progeny of the initial 
transgenic plant where those progeny contain and express the heterologous coding sequence 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 



PCT/US99/18883 



-18- 

under the regulatory control of the plant-expressible transcription control sequences described 
herein. 

Vector: A polynucleotide capable of replication in a host cell and/or to which another 
polynucleotide sequence can be operatively linked so as to bring about replication of the linked 
sequence. A plasmid is an exemplary vector. 

The present invention discloses novel DNA constructs comprising polynucleotide 
sequences encoding B. thuringiensis S-endotoxins. Methods for the construction and expression 
of synthetic B. thuringiensis genes in plants are well known by those of skill in the art and are 
described in detail in U. S. Patent 5,500,365. The present invention contemplates the use of 
Cry3B B. thuringiensis genes in the transformation of both monocotyledonous and 
dicotyledonous plants. To potentiate the expression of these genes, the present invention 
provides DNA constructs comprising polynucleotide segments encoding plastid targeting 
peptides positioned upstream of and in frame with the polynucleotide sequences encoding the 
desired B. thuringiensis 8-endotoxins, along with various combinations of untranslated leader 
sequences, plant functional intron sequences, and transcription termination and polyadenylation 
sequences. 

In one aspect, nucleotide sequence information provided by the invention allows for the 
preparation of relatively short DNA sequences having the ability to specifically hybridize to gene 
sequences of the selected polynucleotides disclosed herein. In these aspects, nucleic acid probes 
of an appropriate length are prepared based on a consideration of selected polypeptide sequences 
encoding Coleopteran inhibitory Cry3B 5-endotoxin polypeptides, e.g., a sequence such as that 
shown in SEQID NO:2 , SEQID NO:4, SEQID NO:6, SEQID NO:8, SEQID NO:10, and SEQID 
NO: 12. These nucleic acid probes may also be prepared based on a consideration of selected 
polynucleotide sequences encoding a plastid targeting peptide, such as those shown in SEQID 
NO:26 The ability of such nucleic acid probes to specifically hybridize to a gene sequence 
encoding a 8-endotoxin polypeptide or a plastid targeting peptide sequence lends to them 
particular utility in a variety of embodiments. Most importantly, the probes may be used in a 
variety of assays for detecting the presence of complementary sequences in a given sample. 

In certain embodiments, it is advantageous to use oligonucleotide primers. The sequence 
of such primers is designed using a polynucleotide of the present invention for use in detecting, 
amplifying or mutating a defined segment of a crystal protein gene from B. thuringiensis using 
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thermal amplification technology. The process may also be used to detect, amplify or mutate a 
defined segment of the polynucleotide encoding a plastid targeting peptide. Segments of genes 
related to the polynucleotides encoding the 5-endotoxin polypeptides and plastid targeting 
peptides of the present invention may also be amplified by using such primers and thermal 
amplification methods. 

To provide certain of the advantages in accordance with the present invention, a preferred 
nucleic acid sequence employed for hybridization studies or assays includes a polynucleotide 
sequences at least about 14 to 30 or so nucleotides in length complimentary to a nucleotide 
sequence encoding a crystal protein, or polynucleotide sequences at least about 14 to 30 or so 
nucleotides in length complimentary to a nucleotide sequence encoding a plastid targeting 
peptide. 

A size of at least 14 nucleotides in length helps to ensure that the fragment will be of 
sufficient length to form a duplex molecule that is both stable and selective. Molecules having 
complementary sequences over segments greater than 14 bases in length are generally preferred. 
In order to increase stability and selectivity of the hybrid, and thereby improve the quality and 
degree of specific hybrid molecules obtained, one will generally prefer to design nucleic acid 
molecules having gene-complementary sequences of 14 to 20 nucleotides, or even longer where 
desired. Such fragments may be readily prepared by, for example, directly synthesizing the 
fragment by chemical means, by application of nucleic acid reproduction technology, such as the 
PCR™ technology of U. S. Patents 4,683,195, and 4,683,202, or by excising selected DNA 
fragments from recombinant plasmids containing appropriate inserts and suitable restriction 
sites. 

The present invention also contemplates an expression vector comprising a 
polynucleotide of the present invention. Thus, in one embodiment an expression vector is an 
isolated and purified DNA molecule comprising a promoter operatively linked to a coding region 
that encodes a polypeptide of the present invention, which coding region is operatively linked to 
a transcription-terminating region, whereby the promoter drives the transcription of the coding 
region. The coding region may include a segment encoding a B. thuringiensis 8-endotoxin and a 
segment encoding a plastid target peptide. The DNA molecule comprising the expression vector 
may also contain a functional intron. As used herein, the terms "operatively linked" or "operably 
linked" mean that a promoter is connected to a coding region in such a way that the transcription 
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of that coding region is controlled and regulated by that promoter. Means for operatively linking 
a promoter to a coding region to regulate both upstream and downstream are well known in the 
art. 

Preferred plant transformation vectors include those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed, e.g., by Herrera-Estrella (1983), Bevan 
(1983), Klee (1985) and Eur. Pat Appl. No. EP 0120516. 

Promoters that function in bacteria are well known in the art. Exemplary and preferred 
promoters for the B. thuringiensis crystal proteins include the sigA, sigE, and sigK gene 
promoters. Alternatively, native, mutagenized, heterologous, or recombinant promoters derived 
from Bacillus thuringiensis 8-endotoxin protein coding sequences can be used. 

Where an expression vector of the present invention is to be used to transform a plant, a 
promoter is selected that has the ability to drive expression in that particular species of plant. 
Promoters that function in different plant species are also well known in the art. Promoters 
useful in expression of polypeptide coding sequences in plants are those which are inducible, 
viral, synthetic, or constitutive as described (Poszkowski et al., 1989; Odell et al, 1985), and/or 
temporally regulated, spatially regulated, and spatio-temporally regulated (Chau et al, 1989). 
Preferred promoters include the enhanced CaMV35S promoters, and the FMV35S promoter. 
Other promoters include the POX promoter, the ScbDNA virus early promoter, and the yellow 
mottle virus promoter. 

In accordance with the present invention, expression vectors designed to specifically 
potentiate the expression of the polypeptide in the transformed plant may include certain regions 
encoding plastid targeting peptides (PTP). These regions allow for the cellular processes 
involved in transcription, translation and expression of the encoded protein to be fully exploited 
when associated with certain B. thuringiensis S-endotoxins. Such plastid targeting peptides 
function in a variety of ways, such as for example, by transferring the expressed protein to the 
cell structure in which it most effectively operates, or by transferring the expressed protein to 
areas of the cell in which cellular processes necessary for expression are concentrated. 

In the case of Cry3B, elevated expression is critical in obtaining transgenic corn with 
CRW control since the LC 50 of Cry3B against CRW is significantly higher than the LC 50 of the 
B. thuringiensis toxins currently used to control pests such as Colorado Potato Beetle in potato 
(Cry3A) or European Corn Borer in corn (Cryl Ab). 
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Increased expression is also especially valuable in that it provides additional protection 
against development of resistance via a high dose strategy (McGaughey and Whalon, 1993; 
Roush, 1994). High level expression is even further desirable as it provides sustained insect 
protection in instances where insecticidal gene expression decreases due to environmental 
conditions. Additionally and unexpectedly, corn plants transformed with vectors expressing 
Coleopteran inhibitory Cry3B or variant proteins exhibited normal growth and development. 

An example of a plastid or chloroplast targeting peptide (CTP) is a chloroplast targeting 
peptide. Chloroplast targeting peptides have been found particularly useful in the glyphosate 
resistant selectable marker system. In this system, plants transformed to express a protein 
conferring glyphosate resistance are transformed with a PTP that targets the peptide to the cell's 
cholorplasts. Glyphosate inhibits the shikimic acid pathway which leads to the biosynthesis of 
aromatic compounds including amino acids and vitamins. Specifically, glyphosate inhibits the 
conversion of phosphoenolpyruvic acid and 3-phosphoshikimic acid to 5-enolpyruvyl-3- 
phosphoshikimic acid by inhibiting the enzyme 5-enolpyruvyl-3-phosphoshikimic acid synthase 
(EPSP synthase or EPSPS). Supplemental EPSPS, conferred via insertion of a transgene 
encoding this enzyme, allows the cell to resist the effects of the gylphosate. Thus, as the 
herbicide glyphosate functions to kill the cell by interrupting aromatic amino acid biosynthesis, 
particularly in the cell's chloroplast, the CTP allows increased resistance to the herbicide by 
concentrating what glyphosate resistance enzyme the cell expresses in the chloroplast, i.e. in the 
target organelle of the cell. Exemplary herbicide resistance enzymes include EPSPS as noted 
above, glyphosate oxido-reductase (GOX) and the aro-A gene (U.S. Patent No. 4,535,060). 

CTP's can target proteins to chloroplasts and other plastids. For example, the target 
organelle may be the amyloplast. Preferred CTP's of the present invention include those targeting 
both chloroplasts as well as other plastids. Specific examples of preferred CTP's include the 
maize RUBISCO SSU protein CTP, and functionally related peptides. An exemplary CTP 
polypeptide is shown in SEQ ID NO:26. A polynucleotide sequence encoding for this CTP 
polypeptide is shown in SEQ ID NO:25. 

The expression of a gene which exists in double-stranded DNA form involves 
transcription of messenger RNA (mRNA) from the coding strand of the DNA by an RNA 
polymerase enzyme, and the subsequent processing of the mRNA primary transcript inside the 
nucleus. Transcription of DNA into mRNA is regulated by a region of DNA usually referred to 
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as the "promoter". The promoter region contains a sequence of bases that signals RNA 
polymerase to associate with the DNA and to initiate the transcription of mRNA using one of the 
DNA strands as a template to make a corresponding strand of RNA. The particular promoter 
selected should be capable of causing sufficient expression of the enzyme coding sequence to 
result in the production of an effective insecticidal amount of the B. thuringiensis protein. 

The 3' non-translated region of the chimeric plant genes of the present invention also 
contains a polyadenylation signal which functions in plants to cause the addition of adenylate 
nucleotides to the 3' end of the RNA. Examples of preferred 3' regions are (1) the 3' transcribed, 
non-translated regions containing the polyadenylation signal of Agrobacterium tumor-inducing 
(Ti) plasmid genes, such as the nopaline synthase (NOS) gene and (2) the 3' ends of plant genes 
such as the pea ssRUBISCO E9 gene (Fischhoff et al, 1987). 

A promoter is selected for its ability to direct the transformed plant cell's or transgenic 
plant's transcriptional activity to the coding region, to ensure sufficient expression of the enzyme 
coding sequence to result in the production of insecticidal amounts of the B. thuringiensis 
protein. Structural genes can be driven by a variety of promoters in plant tissues. Promoters can 
be near-constitutive (i.e. they drive transcription of the transgene in all tissue), such as the 
CaMV35S promoter, or tissue-specific or developmental ly specific promoters affecting dicots or 
monocots. Where the promoter is a near-constitutive promoter such as CaMV35S or FMV35S, 
increases in polypeptide expression are found in a variety of transformed plant tissues and most 
plant organs (e.g., callus, leaf, seed and root). Enhanced or duplicate versions of the CaMV35S 
and FMV35S promoters are particularly useful in the practice of this invention (Kay et al, 1987; 
Rogers, U. S. Patent 5,378,619). Tandemly duplicated enhancer sequences have been 
demonstrated to be of particular significance, for example, as described in Neuhaus et al. 
(Tissue-specific expression from promoter AS-1 in transgenic tobacco. Plant Cell 6: 827-834; 
1994). 

Those skilled in the art will recognize that there are a number of promoters which are 
active in plant cells, and have been described in the literature. Such promoters may be obtained 
from plants or plant viruses and include, but are not limited to, the nopaline synthase (NOS) and 
octopine synthase (OCS) promoters (which are carried on tumor-inducing plasmids of 
A. tumefaciens), the cauliflower mosaic virus (CaMV) 19S and 35S promoters, the light- 
inducible promoter from the small subunit of ribulose 1 ,5-bisphosphate carboxylase 
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(ssRUBISCO, a very abundant plant polypeptide), the rice Actl promoter, POX promoter, 
yellow mottle virus promoter, ScBV virus early promoter, the Figwort Mosaic Virus (FMV) 35S 
promoter, and the AS4 35S promoter (root enhanced expression from 35S promoter linked to 
multiple tandem as-1 sequences as in Neuhaus et al.). All of these promoters have been used to 
create various types of DNA constructs which have been expressed in plants (see e.g., McElroy 
etal, 1990, U. S. Patent 5,463,175). 

In addition, it may also be preferred to bring about expression of the B. thuringiensis 8- 
endotoxin in specific tissues of the plant by using plant integrating vectors containing a tissue- 
specific promoter. Specific target tissues may include the leaf, stem, root, tuber, seed, fruit, etc., 
and the promoter chosen should have the desired tissue and developmental specificity. Therefore, 
promoter function should be optimized by selecting a promoter with the desired tissue expression 
capabilities and approximate promoter strength and selecting a transformant which produces the 
desired insecticidal activity in the target tissues. This selection approach from the pool of 
transformants is routinely employed in expression of heterologous structural genes in plants 
since there is variation between transformants containing the same heterologous gene due to the 
site of gene insertion within the plant genome (commonly referred to as "position effect"). In 
addition to promoters which are known to cause transcription (constitutive or tissue-specific) of 
DNA in plant cells, other promoters may be identified for use in the current invention by 
screening a plant cDNA library for genes which are selectively or preferably expressed in the 
target tissues and then determine the promoter regions. 

An exemplary tissue-specific promoter is the lectin promoter, which is specific for seed 
tissue. The lectin protein in soybean seeds is encoded by a single gene {Lei) that is only 
expressed during seed maturation and accounts for about 2 to about 5% of total seed mRNA. 
The lectin gene and seed-specific promoter have been fully characterized and used to direct seed 
specific expression in transgenic tobacco plants (Vodkin et al, 1983; Lindstrom et al, 1990). 
An expression vector containing a coding region that encodes a polypeptide of interest can be 
engineered to be under control of the lectin promoter and that vector may be introduced into 
plants using, for example, a protoplast transformation method (Dhir et al, 1991). The 
expression of the polypeptide would then be directed specifically to the seeds of the transgenic 
plant. 
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A transgenic plant of the present invention produced from a plant cell transformed with a 
tissue specific promoter can be crossed with a second transgenic plant developed from a plant 
cell transformed with a different tissue specific promoter to produce a hybrid transgenic plant 
that shows the effects of transformation in more than one specific tissue. 

Other exemplary tissue-specific promoters are corn sucrose synthetase 1 (Yang et al , 
1990), corn alcohol dehydrogenase 1 (Vogel et al, 1989), corn light harvesting complex 
(Simpson, 1986), corn heat shock protein (Odell et al, 1985), pea small subunit RuBP 
carboxylase (Poulsen et al, 1986; Cashmore et al, 1983), Ti plasmid mannopine synthase 
(McBride and Summerfelt, 1989), Ti plasmid nopaline synthase (Langridge et al, 1989), petunia 
chalcone isomerase (Van Tunen et al, 1988), bean glycine rich protein 1 (Keller et al, 1989), 
CaMV 35s transcript (Odell et al, 1985) and Potato patatin (Wenzler et al, 1989). Preferred 
promoters are the cauliflower mosaic virus (CaMV 35S) promoter and the S-E9 small subunit 
RuBP carboxylase promoter. 

The promoters used in the DNA constructs of the present invention may be modified, if 
desired, to affect their control characteristics. For example, the CaMV35S promoter may be 
ligated to the portion of the ssRUBISCO gene that represses the expression of ssRUBISCO in 
the absence of light, to create a promoter which is active in leaves but not in roots. The resulting 
chimeric promoter may be used as described herein. For purposes of this description, the phrase 
"CaMV35S" promoter thus includes variations of CaMV35S promoter, e.g. , promoters derived 
by means of ligation with operator regions, random or controlled mutagenesis, etc. Furthermore, 
the promoters may be altered to contain multiple "enhancer sequences" to assist in elevating gene 
expression. Examples of such enhancer sequences have been reported by Kay et al (1987) and 
Neuhaus etal. (1994). 

The RNA produced by a DNA construct of the present invention also contains a 5' non- 
translated leader sequence. This sequence can be derived from the promoter selected to express 
the gene, and can be specifically modified so as to increase translation of the mRNA. The 5' 
non-translated regions can also be obtained from viral RNAs, from suitable eukaryotic genes, or 
from a synthetic gene sequence. The present invention is not limited to constructs wherein the 
non-translated region is derived from the 5' non-translated sequence that accompanies the 
promoter sequence. As shown below, a plant gene leader sequence which is useful in the present 
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invention is the petunia heat shock protein 70 (hsp70) leader (Winter et al, 1988), the wheat 
CAB leader, or the wheat PER leader. 

An exemplary embodiment of the invention involves the plastid targeting of the B. 
thuringiensis sequence. Such plastid targeting sequences have been isolated from numerous 
nuclear encoded plant genes and have been shown to direct importation of cytoplasmically 
synthesized proteins into plastids (reviewed in Keegstra and Olsen, 1989). A variety of plastid 
targeting sequences, well known in the art, including but not limited to ADPGPP, EPSP 
synthase, or ssRUBISCO, may be utilized in practicing this invention. In alternative 
embodiments preferred, plastidic targeting sequences (peptide and nucleic acid) for 
monocotyledonous crops may consist of a genomic fragment coding containing an intronic 
sequence as well as a duplicated proteolytic cleavage site in the encoded plastidic targeting 
sequences. 

The most preferred CTP encoding nucleic acid sequence, referred to herein as zmSSU 
CTP (SEQ ID NO:25), consisting of a genomic fragment containing an intronic sequence as well 
as a duplicated proteolytic cleavage site in the encoded plastidic targeting sequences, was 
derived from plastidic targeting sequence zmSl (Russell et al, 1993). Direct translational 
fusions of zmSSU CTP peptide sequence (SEQ ID NO:26) to the amino terminus of the 
sequence has been shown to be useful in obtaining elevated levels of the polypeptide in 
transgenic maize. In-frame fusions of the zmSSU CTP nucleic acid sequence (SEQ ID NO:25) 
to a crySb gene (SEQ ID NO:l) or gene variant can be effected by ligation of an Ncol site 
engineered into the 3' (C-terminal encoding) end of the zmSSU CTP sequence to a 5' Ncol site 
engineered into the N-terminal encoding end of the cry3B or variant coding sequence. 

The preferred CTP sequence for dicotyledonous crops consists of a genomic coding 
fragment containing the chloroplast targeting peptide sequence from the EPSP synthase gene of 
Arabidopsis thaliana in which the transit peptide cleavage site of the pea ssRUBISCO CTP 
replaces the native EPSP synthase CTP cleavage site (Klee et al, 1987). 

As noted above, the 3' non-translated region of the chimeric plant genes of the present 
invention contains a polyadenylation signal which functions in plants to cause the addition of 
adenylate nucleotides to the 3' end of the RNA. Examples of preferred 3' regions are (1) the 3' 
transcribed, non-translated regions containing the polyadenylate signal of Agrobacterium tumor- 
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inducing (Ti) plasmid genes, such as the nopaline synthase (NOS) gene and (2) plant genes such 
as the pea ssRUBISCO E9 gene (Fischhoff et al, 1987). 

For optimized expression in monocotyledonous plants, an intron may also be included in 
the DNA expression construct. Such an intron is typically placed near the 5'-end of the mRNA 
in an untranslated sequence. This intron could be obtained from, but not limited to, a set of 
introns consisting of the maize Heat Shock Protein (HSP) 70 intron (U. S. Patent 5,424,412; 
1995), the rice Actl intron (McElroy et al, 1990), the Adh intron 1 (Callis et al., 1987), or the 
sucrose synthase intron (Vasil et ah, 1989). As shown herein, the maize HSP70 intron (SEQID 
NO:33) and the rice actin intron (SEQID NO:32) are particularly useful in the present 
invention. 

RNA polymerase transcribes through a coding DNA sequence to a site where 
polyadenylation occurs. Typically, DNA sequences located a few hundred base pairs 
downstream of the polyadenylation site serve to terminate transcription. Those DNA sequences 
are referred to herein as transcription-termination regions. Those regions are required for 
efficient polyadenylation of transcribed messenger RNA (mRNA). 

Constructs will typically include the gene of interest along with a 3' end DNA sequence 
that acts as a signal to terminate transcription and allow for the poly-adenylation of the resultant 
mRNA. The most preferred 3' elements are contemplated to be those from the nopaline synthase 
gene of A. tumefaciens {nos 3'end) (Bevan et al, 1983), the terminator for the T7 transcript from 
the octopine synthase gene of A tumefaciens, and the 3' end of the protease inhibitor i or ii genes 
from potato or tomato. Regulatory elements such as TMV Q element (Gallie, et al., 1989), may 
further be included where desired. 

Another type of element which can regulate gene expression is the DNA sequence 
between the transcription initiation site and the start of the coding sequence, termed the 
untranslated leader sequence. The leader sequence can influence gene expression. Compilations 
of leader sequences have been made to predict optimum or sub-optimum sequences and generate 
"consensus" and preferred leader sequences (Joshi, 1987). Preferred leader sequences are 
contemplated to include those which comprise sequences predicted to direct optimum expression 
of the linked structural gene, i.e. to include a preferred consensus leader sequence which may 
increase or maintain mRNA stability and prevent inappropriate initiation of translation. The 



5/14/2009, EAST Version: 2.3.0.3 



PCT/US99/18883 



choice of such sequences will be known to those of skill in the art in light of the present 
disclosure. Sequences that are derived from genes that are highly expressed in plants, and in 
maize in particular, will be most preferred. One particularly preferred leader may be the wheat 
CAB leader (SEQID NO:31). 

Transcription enhancers or duplications of enhancers could be used to increase 
expression. These enhancers often are found 5' to the start of transcription in a promoter that 
functions in eukaryotic cells, but can often be inserted in the forward or reverse orientation 5' or 
3' to the coding sequence. Examples of enhancers include elements from the CaMV 35S 
promoter, octopine synthase genes (Ellis et al, 1987), the rice actin gene, and promoter from 
non-plant eukaryotes (e.g., yeast; Ma et al, 1988). 

The choice of which expression vector and ultimately to which promoter a polypeptide 
coding region is operatively linked depends directly on the functional properties desired, e.g., the 
location and timing of protein expression, and the host cell to be transformed. These are well 
known limitations inherent in the art of constructing recombinant DNA molecules. However, a 
vector useful in practicing the present invention is capable of directing the expression of the 
polypeptide coding region to which it is operatively linked. 

Typical vectors useful for expression of genes in higher plants are well known in the art 
and include vectors derived from the tumor-inducing (Ti) plasmid of A. tumefaciens described 
(Rogers et al, 1987). However, several other plant integrating vector systems are known to 
function in plants including pCaMVCN transfer control vector described (Fromm et al, 1985). 
pCaMVCN (available from Pharmacia, Piscataway, NJ) includes the CaMV35S promoter. 

In preferred embodiments, the vector used to express the polypeptide includes a selection 
marker that is effective in a plant cell, preferably a drug resistance selection marker. One 
preferred drug resistance marker is the gene whose expression results in kanamycin resistance; 
i.e. the chimeric gene containing the nopaline synthase promoter, Tn5 neomycin 
phosphotransferase II (nptlT) and nopaline synthase 3' non-translated region described (Rogers et 
al, 1988). 

Means for preparing expression vectors are well known in the art. Expression 
(transformation) vectors used to transform plants and methods of making those vectors are 
described in U. S. Patents 4,971,908, 4,940,835, 4,769,061 and 4,757,01 1 . Those vectors can be 
modified to include a coding sequence in accordance with the present invention. 
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A coding region that encodes a polypeptide having the ability to confer insecticidal 
activity to a cell is preferably a polynucleotide encoding a B. thuringiensis 5-endotoxin or a 
functional equivalent of such a polynucleotide. In accordance with such embodiments, a coding 
region comprising the DNA sequences of SEQID NO:l, SEQID NO:3, SEQID NO:5, 
SEQID NO:7, SEQID NO:9, and SEQID NO:l 1 are also preferred. 

Specific B. thuringiensis 8-endotoxin polypeptide-encoding ORF's contained within 
expression cassettes that have been shown to to express the B. thuringiensis 8-endotoxins at high 
levels in transformed plants. Preferred cassettes include those contained in plasmids 
pMON33709, pMON33710, pMON33722, pMON33723, pMON25096, pMON25097, 
pMON33741, and pMON33748. The expression cassettes in these plasmids are respectively 
encoded for by the sequences shown in SEQID NO:13, SEQID NO:15, SEQID NO:36, SEQID 
NO:38, SEQID NO:17, SEQID NO:19, SEQID NO:21, and SEQID NO:23. More preferably, 
plants may be successfully transformed with any expression cassettes comprising the nucleotide 
sequences of nucleotide 14 to 3431 of SEQID NO:36, 14 to 3025 of SEQID NO:38, 14 to 3431 
of SEQID NO: 17, 14 to 3020 of SEQID NO: 19, 14 to 3020 of SEQID NO:21, or 25 to 3450 of 
SEQID NO:23 (pMON33722, pMON33723, pMON25096, pMON25097, pMON33741, and 
pMON33748). Most preferably, plants may be successfully transformed with any expression 
cassettes comprising the nucleotide sequences of nucleotide 14 to 343 1 of SEQID NO: 17, 14 to 
3020 of SEQID NO: 19, 14 to 3020 of SEQID NO:21, or 25 to 3450 of SEQID NO:23 
(pMON25096, pMON25097, pMON33741, and pMON33748). 

The work described herein has identified methods of potentiating in planta expression of 
B. thuringiensis 5-endotoxins, which confer resistance to insect pathogens when incorporated 
into the genome of susceptible plants. U. S. Patent 5,500,365 describes a method for 
synthesizing plant genes to optimize the expression level of the protein for which the synthesized 
gene encodes. This method relates to the modification of the structural gene sequences of the 
exogenous transgene, to make them more "plant-like" and therefore more likely to be translated 
and expressed by the plant. A similar method for enhanced expression of transgenes in 
monocotyledonous plants is disclosed in U. S. Patent 5,689,052. Agronomic, horticultural, 
ornamental, and other economically or commercially useful plants can be made in accordance 
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with the methods described herein, to express B. thuringiensis 5-endotoxins at levels high 
enough to confer resistance to insect pathogens. 

Such plants may co-express the B. thuringiensis S-endotoxin polypeptide along with 
other antifungal, antibacterial, or antiviral pathogenesis-related peptides, polypeptides, or 
proteins; insecticidal proteins; proteins conferring herbicide resistance; and proteins involved in 
improving the quality of plant products or agronomic performance of plants. Simultaneous co- 
expression of multiple proteins in plants is advantageous in that it exploits more than one mode 
of action to control plant pathogenic damage. This can minimize the possibility of developing 
resistant pathogen strains, broaden the scope of resistance, and potentially result in a synergistic 
insecticidal effect, thereby enhancing plants ability to resist insect infestation (WO 92/17591). 

Ultimately, the most desirable DNA segments for introduction into a monocot genome 
may be homologous genes or gene families which encode a desired trait (for example, increased 
yield), and which are introduced under the control of novel promoters or enhancers, etc., or 
perhaps even homologous or tissue specific {e.g., root-collar / sheath-, whorl-, stalk-, earshank-, 
kernel- or leaf-specific) promoters or control elements. Indeed, it is envisioned that a particular 
use of the present invention may be the production of transformants comprising a transgene 
which is targeted in a tissue-specific manner. For example, insect resistant genes may be 
expressed specifically in the whorl and collar/sheath tissues which are targets for the first and 
second broods, respectively, of ECB. Likewise, it is desireable that genes encoding proteins with 
particular activity against rootworm be preferentially expressed in root tissues. 

Vectors for use in tissue-specific targeting of gene expression in transgenic plants 
typically will include tissue-specific promoters and also may include other tissue-specific control 
elements such as enhancer sequences. Promoters which direct specific or enhanced expression in 
certain plant tissues will be known to those of skill in the art in light of the present disclosure. 

It also is contemplated that tissue specific expression may be functionally accomplished 
by introducing a constitutively expressed gene (all tissues) in combination with an antisense gene 
that is expressed only in those tissues where the gene product is not desired. For example, a gene 
coding for the crystal toxin protein from B. thuringiensis may be introduced such that it is 
expressed in all tissues using the 35S promoter from Cauliflower Mosaic Virus. Alternatively, a 
rice actin promoter or a histone promoter from a dicot or monocot species also could be used for 
constitutive expression of a gene. Furthermore, it is contemplated that promoters combining 
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elements from more than one promoter may be useful. For example, U. S. Patent 5,491,288 
discloses combining a Cauliflower Mosaic Virus promoter with a histone promoter. Therefore, 
expression of an antisense transcript of a Bt 8-endotoxin gene in a maize kernel, using for 
example a zein promoter, would prevent accumulation of the 8-endotoxin in seed. Hence the 
protein encoded by the introduced gene would be present in all tissues except the kernel. It is 
specifically contemplated by the inventors that a similar strategy could be used with the instant 
invention to direct expression of a screenable or selectable marker in seed tissue. 

Alternatively, one may wish to obtain novel tissue-specific promoter sequences for use in 
accordance with the present invention. To achieve this, one may first isolate cDNA clones from 
the tissue concerned and identify those clones which are expressed specifically in that tissue, for 
example, using Northern blotting. Ideally, one would like to identify a gene that is not present in 
a high copy number, but which gene product is relatively abundant in specific tissues. The 
promoter and control elements of corresponding genomic clones may thus be localized using the 
techniques of molecular biology known to those of skill in the art. 

It is contemplated that expression of some genes in transgenic plants will be desired only 
under specified conditions. For example, it is proposed that expression of certain genes that 
confer resistance to environmentally stress factors such as drought will be desired only under 
actual stress conditions. It further is contemplated that expression of such genes throughout a 
plants development may have detrimental effects. It is known that a large number of genes exist 
that respond to the environment. For example, expression of some genes such as rbcS, encoding 
the small subunit of ribulose bisphosphate carboxylase, is regulated by light as mediated through 
phytochrome. Other genes are induced by secondary stimuli. For example, synthesis of abscisic 
acid (ABA) is induced by certain environmental factors, including but not limited to water stress. 
A number of genes have been shown to be induced by ABA (Skriver and Mundy, 1 990). It also 
is expected that expression of genes conferring resistance to insect predation would be desired 
only under conditions of actual insect infestation. Therefore, for some desired traits, inducible 
expression of genes in transgenic plants will be desired. 

It is proposed that, in some embodiments of the present invention, expression of a gene in 
a transgenic plant will be desired only in a certain time period during the development of the 
plant. Developmental timing frequently is correlated with tissue specific gene expression. For 
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example expression of zein storage proteins is initiated in the endosperm about 1 5 days after 
pollination. 

It is contemplated that the method described in this invention could be used to obtain 
substantially improved expression of a number of novel B. thuringiensis endotoxins isolated as 
described below. Identification of new Bacillus thuringiensis strains encoding crystalline 
endotoxins with insecticidal activity has been described previously (Donovan et al, 1992). 
Isolation of the B. thuringiensis endotoxin, followed by amino terminal amino acid sequencing, 
back-translation of the amino acid sequence to design an oligonucleotide probe or use of a 
related B. thuringiensis gene as a probe, followed by cloning of the gene encoding the endotoxin 
by hybridization are familiar to those skilled in the art and have been described (see e.g. , 
Donovan et al, 1992, U. S. Patent 5,264,364). Cry3Bb Bacillus thuringiensis 5-endotoxins with 
improved Coleopteran inhibitor}' activity can be achieved using the methods described in English 
etal. (W099/31248). 

A plant transformed with an expression vector of the present invention is also 
contemplated. A transgenic plant derived from such a transformed or transgenic cell is also 
contemplated. Those skilled in the art will recognize that a chimeric plant gene containing a 
structural coding sequence of the present invention can be inserted into the genome of a plant by 
methods well known in the art. Such methods for DNA transformation of plant cells include 
Agrobacterium-mediatud plant transformation, the use of liposomes, transformation using 
viruses or pollen, electroporation, protoplast transformation, gene transfer into pollen, injection 
or vacuum infiltration (Bechtold et al., Meth. Mo. Biol., 82:259-266; 1998) into reproductive 
organs, injection into immature embryos and particle bombardment. Each of these methods has 
distinct advantages and disadvantages. Thus, one particular method of introducing genes into a 
particular plant strain may not necessarily be the most effective for another plant strain, but it is 
well known which methods are useful for a particular plant strain. 

Technology for introduction of DNA into cells is well-known to those of skill in the art. 
Four general methods for delivering a gene into cells have been described: (1) chemical methods 
(Graham and van der Eb, 1973); (2) physical methods such as microinjection (Capecchi, 1980), 
electroporation (Wong and Neumann, 1982; Fromm et ai, 1985) and the gene gun (Johnston and 
Tang, 1994; Fynan et al, 1993); (3) viral vectors (Clapp, 1993; Lu et al., 1993; Eglitis and 
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Anderson, 1988a; 1988b); and (4) receptor-mediated mechanisms (Curiel et al, 1991; 1992; 
Wagner eta!., 1992). 

An advantageous method for delivering transforming DNA segments to plant cells is 
microprojectile bombardment. In this method, particles may be coated with nucleic acids and 
delivered into cells by a propelling force. Exemplary particles include those comprised of 
tungsten, gold, platinum, and the like. Using these particles, DNA is carried through the cell wall 
and into the cytoplasm on the surface of small metal particles as described (Klein et al, 1987; 
Klein et al, 1988; Kawata et al, 1988). The metal particles penetrate through several layers of 
cells and thus allow the transformation of cells within tissue explants. 

An advantage of microprojectile bombardment, in addition to it being an effective means 
of reproducibly stably transforming plant cells, is that neither the isolation of protoplasts (Cristou 
et al, 1988) nor the susceptibility to Agrobacterium infection is required. An illustrative 
embodiment of a method for delivering DNA into plant cells by acceleration is a Biolistics 
Particle Delivery System, which can be used to propel particles coated with DNA or cells 
through a screen, such as a stainless steel or Nytex screen, onto a filter surface covered with the 
plant cultured cells in suspension. The screen disperses the particles so that they are not 
delivered to the recipient cells in large aggregates. It is believed that a screen intervening 
between the projectile apparatus and the cells to be bombarded reduces the size of projectiles 
aggregate and may contribute to a higher frequency of transformation by reducing damage 
inflicted on the recipient cells by projectiles that are too large. 

For the bombardment, cells in suspension are preferably concentrated on filters or solid 
culture medium. Alternatively, immature embryos or other target cells may be arranged on solid 
culture medium. The cells to be bombarded are positioned at an appropriate distance below the 
macroprojectile stopping plate. If desired, one or more screens are also positioned between the 
acceleration device and the cells to be bombarded. Through the use of techniques set forth here- 
in one may obtain up to 1000 or more foci of cells transiently expressing a marker gene. The 
number of cells in a focus which express the exogenous gene product 48 hours post- 
bombardment often range from 1 to 10 and average 1 to 3. 

In bombardment transformation, one may optimize the prebombardment culturing 
conditions and the bombardment parameters to yield the maximum numbers of stable 
transformants. Both the physical and biological parameters for bombardment are important in 
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this technology. Physical factors are those that involve manipulating the DNA/microprojectile 
precipitate or those that affect the flight and velocity of either the macro- or microprojectiles. 
Biological factors include all steps involved in manipulation of cells before and immediately 
after bombardment, the osmotic adjustment of target cells to help alleviate the trauma associated 
with bombardment, and also the nature of the transforming DNA, such as linearized DNA or 
intact supercoiled plasmids. It is believed that pre-bombardment manipulations are especially 
important for successful transformation of immature plant embryos. 

Accordingly, it is contemplated that one may desire to adjust various of the bombardment 
parameters in small scale studies to fully optimize the conditions. One may particularly wish to 
adjust physical parameters such as gap distance, flight distance, tissue distance, and helium 
pressure. One may also minimize the trauma reduction factors (TRFs) by modifying conditions 
which influence the physiological state of the recipient cells and which may therefore influence 
transformation and integration efficiencies. For example, the osmotic state, tissue hydration and 
the subculture stage or cell cycle of the recipient cells may be adjusted for optimum 
transformation. The execution of other routine adjustments will be known to those of skill in the 
art in light of the present disclosure. 

The methods of particle-mediated transformation is well-known to those of skill in the 
art. U. S. Patent 5,015,580 describes the transformation of soybeans using such a technique. 

Agrobacterium-mediated transfer is a widely applicable system for introducing genes into 
plant cells because the DNA can be introduced into whole plant tissues, thereby bypassing the 
need for regeneration of an intact plant from a protoplast. The use of Agrobacterium-mediated 
plant integrating vectors to introduce DNA into plant cells is well known in the art. See, for 
example, the methods described (Fraley et al, 1985; Rogers et al, 1987). The genetic 
engineering of cotton plants using Agrobacterhim-mediated transfer is described in U. S. Patent 
5,004,863; like transformation of lettuce plants is described in U. S. Patent 5,349,124; and the 
Agrobacterium-mediated transformation of soybean is described in U. S. Patent 5,416,01 1. 
Further, the integration of the Ti-DNA is a relatively precise process resulting in few 
rearrangements. The region of DNA to be transferred is defined by the border sequences, and 
intervening DNA is usually inserted into the plant genome as described (Spielmann et al, 1986; 
Jorgensen etal, 1987). 
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Modern Agrobacterium transformation vectors are capable of replication in E. coli as 
well as Agrobacterium, allowing for convenient manipulations as described (Klee et al, 1985). 
Moreover, recent technological advances in vectors for Agrobacterium-mediated gene transfer 
have improved the arrangement of genes and restriction sites in the vectors to facilitate 
construction of vectors capable of expressing various polypeptide coding genes. The vectors 
described (Rogers et al, 1987), have convenient multi-linker regions flanked by a promoter and 
a polyadenylation site for direct expression of inserted polypeptide coding genes and are suitable 
for present purposes. In addition, Agrobacterium containing both armed and disarmed Ti genes 
can be used for the transformations. In those plant varieties where Agrobacterium-mediated 
transformation is efficient, it is the method of choice because of the facile and defined nature of 
the gene transfer. 

Agrobacterium-mediated transformation of leaf disks and other tissues such as 
cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects. 
Agrobacterium-mediated transformation is most efficient in dicotyledonous plants. Few 
monocots appear to be natural hosts for Agrobacterium, although transgenic plants have been 
produced in asparagus using Agrobacterium vectors as described (Bytebier et al, 1987). Other 
monocots recently have also been transformed with Agrobacterium. Included in this group are 
corn (Ishida et al.) and rice (Cheng et al). 

A transgenic plant formed using Agrobacterium transformation methods typically 
contains a single gene on one chromosome. Such transgenic plants can be referred to as being 
heterozygous for the added gene. However, inasmuch as use of the word "heterozygous" usually 
implies the presence of a complementary gene at the same locus of the second chromosome of a 
pair of chromosomes, and there is no such gene in a plant containing one added gene as here, it is 
believed that a more accurate name for such a plant is an independent segregant, because the 
added, exogenous gene segregates independently during mitosis and meiosis. 

An independent segregant may be preferred when the plant is commercialized as a 
hybrid, such as corn. In this case, an independent segregant containing the gene is crossed with 
another plant, to form a hybrid plant that is heterozygous for the gene of interest. 

An alternate preference is for a transgenic plant that is homozygous for the added 
structural gene; i.e. a transgenic plant that contains two added genes, one gene at the same locus 
on each chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by 
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sexually mating (selfing) an independent segregant transgenic plant that contains a single added 
gene, germinating some of the seed produced and analyzing the resulting plants produced for 
gene of interest activity and mendelian inheritance indicating homozygosity relative to a control 
(native, non-transgenic) or an independent segregant transgenic plant. 

Two different transgenic plants can be mated to produce offspring that contain two 
independently segregating added, exogenous genes. Selfing of appropriate progeny can produce 
plants that are homozygous for both added, exogenous genes that encode a polypeptide of 
interest. Back-crossing to a parental plant and out-crossing with a non-transgenic plant are also 
contemplated. 

Transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
these treatments (see e.g., Potrykus et al, 1985; Lorz et al, 1985; Fromm et al, 1985; Uchimiya 
et al, 1986; Callis et al, 1987; Marcotte et al, 1988). Application of these systems to different 
plant germplasm depends upon the ability to regenerate that particular plant variety from 
protoplasts. Illustrative methods for the regeneration of cereals from protoplasts are described 
(see, e.g., Fujimura et al, 1985; Toriyama et al, 1986; Yamada et al, 1986; Abdullah et al, 
1986). To transform plant germplasm that cannot be successfully regenerated from protoplasts, 
other ways to introduce DNA into intact cells or tissues can be utilized. For example, 
regeneration of cereals from immature embryos or explants can be effected as described (Vasil, 
1988). DNA can also be introduced into plants by direct DNA transfer into pollen as described 
(Zhou et al, 1983; Hess, 1987). Expression of polypeptide coding genes can be obtained by 
injection of the DNA into reproductive organs of a plant as described (Pena et al, 1987). DNA 
can also be injected directly into the cells of immature embryos and the rehydration of desiccated 
embryos as described (Neuhaus et al, 1987; Benbrook et al, 1986). 

Unmodified bacterial genes are often poorly expressed in transgenic plant cells. Several 
reports have disclosed methods for improving expression of recombinant genes in plants ( 
Murray et al. , 1 989; Diehn et al. , 1 996; Iannacone et al , 1 997; Rouwendal et al , 1 997; Futterer 
et al, 1997; and Futterer and Hohn, 1996). These reports disclose various methods for 
engineering coding sequences to represent sequences which are more efficiently translated based 
on plant codon frequency tables, improvements in codon third base position bias, using 
recombinant sequences which avoid suspect polyadenylation or A/T rich domains or intron 
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splicing concensus sequences. While these methods for synthetic gene construction are notable, 
synthetic genes of the present invention were prepared according to the method of Brown et al. 
(US Pat. No. 5,689,052; 1997). Thus, the present invention provides a method for preparing 
synthetic plant genes express in planta a desired protein product at levels significantly higher 

5 than the wild-type genes. Briefly, according to Brown et al., the frequency of rare and semi-rare 
monocotyledonous codons in a polynucleotide sequence encoding a desired protein are reduced 
and replaced with more preferred monocotyledonous codons. Enhanced accumulation of a 
desired polypeptide encoded by a modified polynucleotide sequence in a monocotyledonous 
plant is the result of increasing the frequency of preferred codons by analyzing the coding 

10 sequence in successive six nucleotide fragments and altering the sequence based on the 

frequency of appearance of the six-mers as to the frequency of appearance of the rarest 284, 484, 
and 664 six-mers in monocotyledenous plants. Furthermore, Brown et al. disclose the enhanced 
expression of a recombinant gene by applying the method for reducing the frequency of rare 
codons with methods for reducing the occurrence of polyadenylation signals and intron splice 

is sites in the nucleotide sequence, removing self-complementary sequences in the nucleotide 
sequence and replacing such sequences with nonself-complementary nucleotides while 
maintaining a structural gene encoding the polypeptide, and reducing the frequency of 
occurrence of 5'-CG-3' dinucleotide pairs in the nucleotide sequence. These steps are performed 
sequentially and have a cumulative effect resulting in a nucleotide sequence containing a 

20 preferential utilization of the more-preferred monocotyledonous codons for monocotyledonous 
plants for a majority of the amino acids present in the desired polypeptide. 

Thus, the amount of a gene coding for a polypeptide of interest (i.e. a bacterial crystal 
protein or 5-endotoxin polypeptide or such 8-endotoxin linked to a plastid targeting peptide) can 
be increased in plants by transforming those plants using transformation methods such as those 

25 disclosed herein. 

After effecting delivery of exogenous DNA to recipient cells, the next step to obtain a 
transgenic plant generally concern identifying the transformed cells for further culturing and 
plant regeneration. As mentioned herein, in order to improve the ability to identify 
transformants, it is preferable to employ a selectable or screenable marker gene as, or in addition 
30 to, the expressible gene of interest. In this case, one would then generally assay the potentially 
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transformed cell population by exposing the cells to a selective agent or agents, or one would 
screen the cells for the desired marker gene trait. 

An exemplary embodiment of methods for identifying transformed cells involves 
exposing the transformed cultures to a selective agent, such as a metabolic inhibitor, an 
antibiotic, herbicide or the like. Cells which have been transformed and have stably integrated a 
marker gene conferring resistance to the selective agent used, will grow and divide in culture. 
Sensitive cells will not be amenable to further culturing. One example of a preferred marker 
gene encoding an EPSPS synthase which is resistant to glyphosate inhibition. When this gene is 
used as a selectable marker, the putatively transformed cell culture is treated with glyphosate. 
Upon treatment, transgenic cells will be available for further culturing while sensitive, or non- 
transformed cells, will not. This method is described in detail in U. S. Patent 5,569,834. 
Another example of a preferred selectable marker system is the nptll system by which resistance 
to the antibiotics kanamycin, neomycin, and paromomycin or related antibiotics is conferred, as 
described in U. S. Patent 5,569,834. Again, after transformation with this system transformed 
cells containing a plant expressible nptll gene will be available for further culturing upon 
treatment with kanamycin or related antibiotic, while non-transformed cells will not. Use of this 
type of a selectable marker system is described in Brown et al. (U S Patent No. 5,424,412). 
Another screenable marker which may be used is the gene coding for green fluorescent protein. 
All contemplated assays are nondestructive and transformed cells may be cultured further 
following identification. 

It is further contemplated that combinations of screenable and selectable markers will be 
useful for identification of transformed cells. In some cell or tissue types a selection agent, such 
as glyphosate or kanamycin, may either not provide enough killing activity to clearly recognize 
transformed cells or may cause substantial nonselective inhibition of transformants and non- 
transformants alike, thus causing the selection technique to not be effective. It is proposed that 
selection with a growth inhibiting compound, such as glyphosate at concentrations below those 
that cause 100% inhibition followed by screening of growing tissue for expression of a 
screenable marker gene such as kanamycin would allow one to recover transformants from cell 
or tissue types that are not amenable to selection alone. It is proposed that combinations of 
selection and screening may enable one to identify transformants in a wider variety of cell and 
tissue types. 
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The development or regeneration of plants from either single plant protoplasts or various 
explants is well known in the art (Weissbach and Weissbach, 1988). This regeneration and 
growth process typically includes the steps of selection of transformed cells, culturing those 
individualized cells through the usual stages of embryonic development through the rooted 
plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic 
rooted shoots are thereafter planted in an appropriate plant growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous gene that 
encodes a polypeptide of interest introduced by Agrobacterium from leaf explants can be 
achieved by methods well known in the art such as described (Horsch et al, 1985). In this 
procedure, transformants are cultured in the presence of a selection agent and in a medium that 
induces the regeneration of shoots in the plant strain being transformed as described (Fraley et 
al, 1983). In particular, U. S. Patent 5,349,124 details the creation of genetically transformed 
lettuce cells and plants resulting therefrom which express hybrid crystal proteins conferring 
insecticidal activity against Lepidopteran larvae to such plants. This procedure typically 
produces shoots within two to four months and those shoots are then transferred to an 
appropriate root-inducing medium containing the selective agent and an antibiotic to prevent 
bacterial growth. Shoots that rooted in the presence of the selective agent to form plantlets are 
then transplanted to soil or other media to allow the production of roots. These procedures vary 
depending upon the particular plant strain employed, such variations being well known in the art. 

A transgenic plant of this invention thus has an increased amount of a coding region 
encoding a B. thuringiensis 5-endotoxin polypeptide or variant thereof or may encode such a 8- 
endotoxin linked to a plastid targeting peptide. A preferred transgenic plant is an independent 
segregant and can transmit that gene and its activity to its progeny. A more preferred transgenic 
plant is homozygous for that gene, and transmits that gene to all of its offspring on sexual 
mating. Seed from a transgenic plant may be grown in the field or greenhouse, and resulting 
sexually mature transgenic plants are self-pollinated to generate true breeding plants. The 
progeny from these plants become true breeding lines that are evaluated for increased expression 
of the transgene encoding the 8-endotoxin. 

To identify a transgenic plant expressing high levels of the 8-endotoxin of interest, it is 
necessary to screen the herbicide or antibiotic resistant transgenic, regenerated plants (Ro 
generation) for insecticidal activity and/or expression of the gene of interest. This can be 
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accomplished by various methods well known to those skilled in the art, including but not 
limited to: 1) obtaining small tissue samples from the transgenic Ro plant and directly assaying 
the tissue for activity against susceptible insects in parallel with tissue derived from a non- 
expressing, negative control plant. For example, Ro transgenic corn plants expressing B. 
thuringiensis endotoxins such as Cry3B can be identified by assaying leaf tissue or root tissue 
derived from such plants for activity against CRW; 2) analysis of protein extracts by enzyme 
linked immunoassays (ELISAs) specific for the gene of interest (Cry3B); or 3) reverse 
transcriptase thermal amplification to identify events expressing the gene of interest. 

The genes and 8-endotoxins according to the subject invention include not only the full 
length sequences disclosed herein but also fragments of these sequences, or fusion proteins, 
which retain the characteristic insecticidal activity of the sequences specifically exemplified 
herein. 

It should be apparent to a person of skill in the art that insecticidal 5-endotoxins can be 
identified and obtained through several means. The specific genes, or portions thereof, may be 
obtained from a culture depository, or constructed synthetically, for example, by use of a gene 
machine. Variations of these genes may be readily constructed using standard techniques for 
making point mutations. Also, fragments of these genes can be made using commercially 
available exonucleases or endonucleases according to standard procedures. For example, 
enzymes such as BaB 1 or site-directed mutagenesis can be used to systematically cut off 
nucleotides from the ends of these genes. Also, genes which code for active fragments may be 
obtained using a variety of other restriction enzymes. Proteases may be used to directly obtain 
active fragments of these 6-endotoxins. 

Equivalent 5-endotoxins and/or genes encoding these 5-endotoxins can also be isolated 
from Bacillus strains and/or DNA libraries using the teachings provided herein. For example, 
antibodies to the 8-endotoxins disclosed and claimed herein can be used to identify and isolate 
other 5-endotoxins from a mixture of proteins. Specifically, antibodies may be raised to the 
portions of the 5-endotoxins which are most constant and most distinct from other B. 
thuringiensis 5-endotoxins. These antibodies can then be used to specifically identify equivalent 
5-endotoxins with the characteristic insecticidal activity by immunoprecipitation, enzyme linked 
immunoassay (ELISA), or Western blotting. 
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A further method for identifying the 6-endotoxins and genes of the subject invention is 
through the use of oligonucleotide probes. These probes are nucleotide sequences having a 
detectable label. As is well known in the art, if the probe molecule and nucleic acid sample 
hybridize when together in a sample by forming hydrogen bonds between the two molecules, it 
can be reasonably assumed that the probe and sample are essentially identical or substantially 
similar or homologous at least along the length of the probe. The probe's detectable label 
provides a means for determining in a known manner whether hybridization has occurred. Such 
a probe analysis provides a rapid method for identifying insecticidal 8-endotoxin genes of the 
subject invention. 

Duplex formation and stability depend on substantial complementary between the two 
strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
Therefore, the probes of the subject invention include mutations (both single and multiple), 
deletions, insertions of the described sequences, and combinations thereof, wherein said 
mutations, insertions and deletions permit formation of stable hybrids with the target 
polynucleotide of interest. Mutations, insertions, and deletions can be produced in a given 
polynucleotide sequence in many ways, by methods currently known to an ordinarily skilled 
artisan, and perhaps by other methods which may become known in the future. 

The potential variations in the probes listed is due, in part, to the redundancy of the 
genetic code. Because of the redundancy of the genetic code, more than one coding nucleotide 
triplet (codon) can be used for most of the amino acids used to make proteins. Therefore 
different nucleotide sequences can code for a particular amino acid. Thus, the amino acid 
sequences of the B. thuringiensis 8-endotoxins and peptides, and the plastid targeting peptides 
and the polynucleotides which code for them, can be prepared by equivalent nucleotide 
sequences encoding the same amino acid sequence of the protein or peptide. 

Site-specific mutagenesis is a technique useful in the preparation of individual peptides, 
or biologically functional equivalent proteins or peptides, through specific mutagenesis of the 
underlying DNA. The technique further provides a ready ability to prepare and test sequence 
variants, for example, incorporating one or more of the foregoing considerations, by introducing 
one or more nucleotide sequence changes into the DNA. 

In general, the technique of site-specific mutagenesis is well known in the art, as 
exemplified by various publications. As will be appreciated, the technique typically employs a 
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phage vector which exists in both a single stranded and double stranded form. Typical vectors 
useful in site-directed mutagenesis include vectors such as the Ml 3 phage or plasmids containing 
an Ml 3 origin of replication. These phage are readily commercially available and their use is 
generally well known to those skilled in the art. 

Modification and changes may be made in the structure of the peptides of the present 
invention and DNA segments which encode them and still obtain a functional molecule that 
encodes a protein or peptide with desirable characteristics. The biologically functional equivalent 
peptides, polypeptides, and proteins contemplated herein should possess about 80% or greater 
sequence similarity, preferably about 85% or greater sequence similarity, and most preferably 
about 90% or greater sequence similarity, to the sequence of, or corresponding moiety within, 
the fundamental Cry3B amino acid sequence. 

The following is a discussion based upon changing the amino acids of a protein to create 
an equivalent, or even an improved, second-generation molecule. In particular embodiments of 
the invention, mutated crystal proteins are contemplated to be useful for increasing the 
insecticidal activity of the protein, and consequently increasing the insecticidal activity and/or 
expression of the recombinant transgene in a plant cell. The amino acid changes may be 
achieved by changing the codons of the DNA sequence, according to the codons given in readily 
available amino acid codon tables. 

For example, certain amino acids may be substituted for other amino acids in a protein 
structure without appreciable loss of interactive binding capacity with structures such as, for 
example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it 
is the interactive capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of 
course, its underlying DNA coding sequence, and nevertheless obtain a protein with like 
properties. It is thus contemplated by the inventors that various changes may be made in the 
peptide sequences of the disclosed compositions, or corresponding DNA sequences which 
encode said peptides without appreciable loss of their biological utility or activity. 

In making such changes, the hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte and Doolittle, 1 982, incorporate herein by 
reference). It is accepted that the relative hydropathic character of the amino acid contributes to 
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the secondary structure of the resultant protein, which in turn defines the interaction of the 
protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, 
antigens, and the like. 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar biological 
activity, i.e. still obtain a biological functionally equivalent protein. It is also understood in the 
art that the substitution of like amino acids can be made effectively on the basis of 
hydrophilicity. U. S. Patent 4,554,101 states that the greatest local average hydrophilicity of a 
protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a 
biological property of the protein. It is understood that an amino acid can be substituted for 
another having a similar hydrophilicity value and still obtain a biologically equivalent, and in 
particular, an immunologically equivalent protein. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and 
asparagine; and valine, leucine and isoleucine. 

Polynucleotides encoding 5-endotoxins derived from B. thuringiensis are known by those 
skilled in the art, to be poorly expressed when incorporated into the nuclear DNA of transgenic 
plants (reviewed by Diehn et al, 1996). Preferably, a nucleotide sequence encoding the 8- 
endotoxin of interest is designed essentially as described in U. S. Patent 5,500,365 and 
5,689,052. Examples of nucleotide sequences useful for expression include but are not limited 
to, cry3B (SEQIDNO:5), cry3Bbl (SEQID NO:l), cry3Bb2 (SEQIDNO:3), vll231 (SEQID 
NO:7), 11231mvl (SEQID NO:9), and 11231mv2 (SEQID NO:l 1). 

Peptides, polypeptides, and proteins biologically functionally equivalent to Cry3B 
include amino acid sequences containing conservative amino acid changes in the fundamental 
sequence shown in SEQID NO:2, SEQID NO:4, SEQID NO:8., SEQID NO: 10, and SEQID 
NO:12(Cry3Bbl,Cry3Bb2,vll231, 11231mvl, 1123 lmv2, Cry3Bb.l 1231, or Cry3Bb.ll098, 
etc). In such amino acid sequences, one or more amino acids in the fundamental sequence is 
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(are) substituted with another amino acid(s), the charge and polarity of which is similar to that of 
the native amino acid, i.e. a conservative amino acid substitution, resulting in a silent change. 

Substitutes for an amino acid within the fundamental polypeptide sequence can be 
selected from other members of the class to which the naturally occurring amino acid belongs. 
Amino acids can be divided into the following four groups: (1) acidic amino acids; (2) basic 
amino acids; (3) neutral polar amino acids; and (4) neutral non-polar amino acids. 
Representative amino acids within these various groups include, but are not limited to: (1) acidic 
(negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic (positively 
charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar amino acids such 
as glycine, serine, threonine, cyteine, cystine, tyrosine, asparagine, and glutamine; (4) neutral 
nonpolar (hydrophobic) amino acids such as alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan, and methionine. 

Conservative amino acid changes within the fundamental polypeptide sequence can be 
made by substituting one amino acid within one of these groups with another amino acid within 
the same group. Biologically functional equivalents of Cry3B can have 10 or fewer conservative 
amino acid changes, more preferably seven or fewer conservative amino acid changes, and most 
preferably five or fewer conservative amino acid changes. The encoding nucleotide sequence 
(gene, plasmid DNA, cDNA, non-naturally occurring, or synthetic DNA) will thus have 
corresponding base substitutions, permitting it to encode biologically functional equivalent forms 
ofCry3B. 

The present invention provides methods and compositions for expressing Coleopteran inhibitory 
Cry3B B. thuringiensis 5-endotoxins or amino acid sequence variants thereof at unexpectedly high levels 
in transgenic plants. The disclosed methods and compositions may exploit any of the DNA constructs 
disclosed as well as any of the transformation vectors disclosed herein. The contemplated methods and 
compositions enable Cry3Bb 5-endotoxins or amino acid sequence variants thereof to be expressed in 
plants without negatively affecting the recovery of agronomic qualities of transgenic plants. The 
inventions described herein also enables expression of Cry3B 8-endotoxins and variants at levels up to 
500 times higher than that achieved by previous methods and compositions. 

The methods described here thus enables plants expressing Cry3B or variants to be used as either 
an alternative or supplement to plants expressing other Cry proteins such as a Cry3B variant, a Cry3A 
or Cry3D or variant, CryET33 and CryET34 or variants thereof, a CryET70 or variant, a 
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CryET29 or variant, a Cry6A or Cry6B or variant, a Cry8B or variant, insecticidal acyl lipid 
hydrolases, combinations of amino acid oxidases and tedanalactam synthases, and other 
insecticidal proteins such as VIP1 and VIP3 and various combinations isolated from 
Heterorhabdus, Photorhabdus, and Xenorhabdus species for both control and resistance 
management of key insect pests, including Ostrina sp, Diatraea sp,, Diabrotica, Helicoverpa sp, 
Spodoptera sp in Zea mays; Heliothis virescens , Helicoverpa sp, Pectinophora sp. in Gossypium 
hirsutum; and Anticarsia sp, Pseudoplusia sp, Epinotia sp in Glycine max. It is also contemplated that the 
methods described may be used to dramatically increase expression of B. thuringiensis 8-endotoxins 
including and related to Cry3, thus increasing its effectiveness against target pests and decreasing the 
likelihood of evolved resistance to these proteins. In one embodiment of the present invention, a Cry3 8- 
endotoxin is expressed. Target pests of this protein and their common hosts are shown below in Table 1. 

Table 1 

Target Pests Affected by Coleopteran Active (Inhibitory) Cry3B 8-Endotoxin and Common 
Plant Hosts of Those Pests 



Pests 


Hosts 


Lcptinotarsa dcccmlineata 
(Colorado Potato Beetle) 


Potato 


Diabrotica barberi 
(Northern Corn Rootworm) 


Corn 


Diabrotica undecimpunctata 
(Southern Corn Rootworm) 


Corn 


Diabrotica virgifera 
(Western Corn Rootworm) 


Corn 


Anthonomis grandis 
(Boll Weevil) 


Cotton 


Triboleum castaneum 
(Red Flour Beetle) 


Wheat 


Papilla japonica 
(Japanese Flour Beetle) 


Wheat 



Antibodies were required for studies comparing expression of various Cry3 coding sequences, so 
polyclonal serum was generated as follows. Cry3 Bt crystals were collected from a sporulated 
fermentation of Bacillus thuringiensis recombinant strain 1 1 037 expressing native Cry3Bb. Crystals were 
solubilized in 100 mM sodium carbonate buffer, pH10.5, to give a concentration of 2.7 mg protein per 
mL as measured by a colorimetric bicinchoninic acid assay (Smith et al, 1985). . A sample was diluted to 
a concentration of 0.4 mg/mL and mixed with an equal volume of Freund's complete adjuvant. A 1 
milliliter inoculum of this mixture was used for the first intradermal injection into a rabbit. A first bleed 
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was collected two weeks later. Subsequent injections of Cry3Bb protein designed to boost the immune 
titer were prepared by mixing equal volumes of 0.2 mg/mL protein with equal volumes of Freund's 
incomplete adjuvant. 1 milliliter injections were administered at four week intervals, and additional 
bleeds were obtained every two weeks. Immune serum adequate for analytical purposes was prepared 

5 from rabbit #783 after purification over a Protein A Sepharose CL-4B affinity chromatography according 
to the manufacturers' instructions (Sigma Chemical Co, St. Louis, Missouri) and concentrated to 1 
milligram of IgG protein per milliliter and stored in the dark at 4°C. A sample of this antiserum was 
conjugated to alkaline phosphatase enzyme for subsequent use in quantitative ELISA assays. 

Leaf and root samples were collected from plants expressing Cry3Bb variant proteins 1 123 1, 

10 11 084, 1 1 098, and 1 1 247. Extracts of plant samples were prepared as follows. Plant tissue, root or leaf 
parts, was harvested and weighed on a gram scale. Leaf tissue was mixed with 20 parts TBA buffer, 
weight to volume. Root tissue was mixed with 10 parts TBA buffer, weight to volume. Tissues were 
ground into an emulsion using a Wheaton™ overhead grinder and stored on ice or at -20°C. 250 
microliters of rabbit anti-Cry3Bb antiserum diluted 1:1000 in carbonate coating buffer, pH9.6, was 

15 distributed onto each well of a 96-well microtiter plate and incubated overnight at 4°C. The plate was 
then washed with PBST (3x5 min). Tissue extract samples were loaded in duplicate at 20 microliters 
per well and at varying dilutions in order to obtain a value within a standard curve established using 
Cry3Bb variant 1 123 1 . Plates were incubated overnight at 4°C, then washed with PBST three times, five 
minutes each time. 50 microliters of the rabbit anti-Cry3B alkaline phosphatase conjugated polyclonal 

20 antibody was added to each well, followed by the addition of 1 80 uL of PBST containing 1 % PVP-40 
(Sigma). After overnight incubation, plates were washed with PBST (3X5 min) and developed with 
alkaline phosphatase color development solution consisting of 20 mg para-nitrophenyl phosphate in 25 
mL diethanolamine, pH9.8, 200 uL/well). Plates were read at X405 after 1 5-20 minutes, using a quadratic 
curve fit to a protein standard curve where the optical density of the highest standard was approximately 

25 1.00. 

5.0 EXAMPLES 

The following examples are included to demonstrate preferred embodiments of the 
30 invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
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that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

5.1 Example 1 - Isolation, Characterization, and Identification of Cry3 proteins and 
genes, and Construction of Amino Acid Sequence Variants Thereof 

Means for identifying and characterizing Coleopteran toxic gene products are well 
documented in the art, and methods for isolating, characterizing and identifying the genes which 
encode such gene products are also well known in the art. In addition, the means for producing 
amino acid sequence variants of such Coleopteran toxic 5-endotoxin proteins are also well 
known. In particular, Van Rie et al. (US Patent No. 5,659,123; 1997) identify Cry3A and D 
toxins which exhibit Coleopteran inhibitory properties, and also set forth a method for 
identifying mutants which can be constructed which have reduced insecticidal activity with 
reference to the wild type protein. Van Rie et al. describe how those particular mutants can be 
further manipulated to identify amino acid sequence variant toxins which exhibit increased 
insecticidal activity with reference to the wild type protein. English et al. (WO 99/31248) 
describe other methods and compositions, in particular for Cry3B, which enable the 
identification of Cry3 encoding genes and gene products and the methods which can be used to 
construct and identify amino acid sequence variants exhibiting improved insecticidal activity 
with reference to that of the wild type Cry3 protein. Several coding sequences used herein were 
derived from those described in English et al. and the proteins produced from these coding 
sequences represent in particular the variants 1 1231 or 1 1098 as described therein. 

5.2 

Example 2. Construction of monocot plant expression vectors for the Cry3Bb variants 

Design of cry3Bb variant genes for plant expression 

For efficient expression of the Cry3Bb variants in transgenic plants, the gene encoding the 
variants must have a suitable sequence composition (Diehn et al, 1996). One example of such a 
sequence is shown for the vll231 gene (SEQID NO:7) which encodes the 11231 variant of the 
Cry3Bb protein (SEQID NO: 8) exhibiting Diabroticus activity. This gene was derived via 
mutagenesis (Kunkel, 1985) of a Cry3Bb synthetic gene (SEQID NO:5) encoding a protein 
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essentially homologous to the protein encoded by the native Cry3Bb gene (Gen Bank Accession 
Number m89794; SEQID NO:l) . The following oligonucleotides were used in the mutagenesis 
of the original Cry3Bb synthetic gene (SEQID NO:5) to create the vl 1231 gene (SEQID NO:7) 

5 Oligo # 1 : TAGGCCTCC ATCC ATGGC A A ACCCTAACA ATC (SEQID NO: 40) 

Oligo #2: TCCCATCTTCCTACTTACGACCCTGCAGAAATACGGTCCAAC.(SEQID NO:41) 
Oligo #3: GACCTCACCTACCAAACATTCGATCTTG (SEQID NO: 42) 
Oligo #4: CGAGTTCTACCGTAGGCAGCTCAAG (SEQID NO:43) 

io Construction of crv3Bb monocot plant expression vector 

To place the Cry3Bb variant gene vl 1231 in a vector suitable for expression in 
monocotyledonous plants (i.e. under control of the enhanced Cauliflower Mosaic Virus 35S 
promoter and linker to the hsp70 intron followed by a nopaline synthase polyadenylation site as 
in Brown and Santino US patent number 5,424,412; 1995), the vector pMONl 9469 was digested 

is with Ncol and EcoRI. The larger vector band of approximately 4.6 kb was isolated after 

electrophoresis of the digestion products through an agarose gel, purified, and ligated with T4 
DNA ligase to the NcoI-EcoRI fragment of approximately 2 kb containing the vl 123 1 gene 
(SEQID NO:7). The ligation mix was transformed into a useful laboratory strain of E. coli, and 
carbenicillin resistant colonies were recovered. Plasmid DNA was recovered by miniprep DNA 

20 procedures from subsequent overnight cultures of carbenicillin resistant colonies selected into 
broth containing antibiotics. This DNA was subjected to restriction endonuclease analysis with 
enzymes such as Ncol and EcoRI, NotI, and PstI to identify clones containing the vl 1231 coding 
sequence fused to the hsp70 intron under control of the enhanced CaMV35S promoter. Clones 
identified as such were designated as pMON33708. 

25 

To place the vl 123 1 gene in a vector suitable for recovery of stably transformed and 
insect resistant plants, the 3.75 kb NotI restriction fragment from pMON33708 containing the 
lysine oxidase coding sequence fused to the hsp70 intron under control of the enhanced 
CaMV35S promoter was isolated and purified after extraction from an agarose gel. This 
30 fragment was ligated with pMON30460 treated with NotI and calf intestinal alkaline 

phosphatase. pMON30460 contains the neomycin phosphotransferase coding sequence under 
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control of the CaMV35S promoter. Kanamycin resistant colonies were obtained by 
transformation of this ligation mix into E. coli and colonies containing the appropriate band were 
identified by restriction endonuclease digestion and designated as pMON33710. Restriction 
enzymes such as NotI, EcoRV, Hindlll, Ncol, EcoRI, and Bglll were used to identify the 
appropriate clones containing the NotI fragment of pMON33708 in the NotI site of pMON30460 
(i.e. pMON33710) in the orientation such that both genes are in tandem (i.e. the 3' end of the 
vl 123 1 expression cassette is linked to the 5' end of the nptll expression cassette). Expression of 
the vl 1231 protein by pMON33710 in corn protoplasts was confirmed by electroporation of 
pMON33710 covalently closed circular plasmid DNA into protoplasts followed by protein blot 
and ELISA analysis. This vector can be introduced into the genomic DNA of corn embryos by 
particle gun bombardment followed by paromomycin selection to obtain corn plants expressing 
the vl 1231 gene essentially as described in Brown and Santino US patent number 5,424,412. In 
this example, the vector was introduced into immature embryo scutella (IES) of maize via co- 
bombardment along with with a plasmid conferring hygromycin resistance, followed by 
hygromycin selection, and regeneration. Transgenic corn lines expressing the vl 123 1 protein 
were identified by ELISA analysis scoring for both the presence and amount of vl 1231 protein 
present in each extract sample. Plants were selfed and allowed to go to seed. Progeny seed were 
cured and planted to produce seedling corn plants which were subsequently tested for protection 
from Diabroticus feeding. 

In plant performance of Crv3Bb variant 1 1231 

Transformed corn plants expressing Cry3Bb variant 1 123 1 protein were challenged with western 
corn rootworm (WCR) larvae in both a seedling and 1 0 inch pot assay. The transformed 
genotype was A634, where the progeny of the R0 cross by A634 was evaluated. Observations 
included effect on larval development (weight), root damage rating (RDR), and protein 
expression. The transformation vector containing the Cry3Bb variant gene was pMON337 1 0. 
Treatments included the positive and negative iso-populations for each event and an A634 check. 

The seedling assay consisted of the following steps; i. single seeds were placed in 1 oz cups 
containing potting soil; ii. at spiking, each seedling was infested with 4 neonate larvae, and iii. 
after infestation, seedlings were incubated for 7 days at 25°C, 50% RH, and 14:10 (L:D) photo 
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period. Adequate moisture was added to the potting soil during the incubation period to 
maintain seedling vigor. 

The 10 inch pot assay consisted of the following steps; i. single seeds were placed in 10 inch 
pots containing potting soil; ii. at 14 days post planting, each pot was infested with 800 eggs 
which have been pre-incubated such that hatch would occur 5-7 days post infestation; and iii. 
after infestation, plants were incubated for 4 weeks under the same environmental conditions as 
the seedling assay. Pots were both sub & top irrigated daily. 

For the seedling assay, on day 7 plants were given a root damage rating (Table 1 .) and surviving 
larvae were weighed. Also at this time, Cry3Bb protein concentrations in the roots were 
determined by ELISA. 

Table 1. Root Damage Rating Scale for seedling assay. 
RDR 0 = no visible feeding 

1 = very light feeding 

2 = light feeding 

3 = moderate feeding 

4 = heavy feeding 

5 = very heavy feeding 

Results of the seedling assay are shown in Table 2. Plants expressing Cry3Bb protein were 
completely protected by WCR feeding, where surviving larvae within this treatment had not 
grown. Mean larval weights ranged from 2.03 - 2.73 mg for the non-expressing treatments, 
where the surviving larval average weight was 0.1 1 mg on the expressing Cry3Bb treatment. 
Root damage ratings were 3.86 and 0.33 for the non-expressing and expressing iso-populations, 
respectively. Larval survival ranged from 75 - 85 % for the negative and check treatments, 
where only 25 % of the larvae survived on the Cry3Bb treatment. 
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Table 2. Effect of Cry3Bb expressing plants on WCR larvae in a seedling assay. 







Plants 




Larvae 








Root 




% 


Mean+SD 


Event Treatment 


N 


(ppm) RDR+SD 


N 


Surv 


Wt.(mg) 


16 Negative 


7 


0.0 3.86±0.65 


21 


75 


2.73±1.67 


16 Positive 


3 


29.01 0.33+0.45 


3 


25 


0.11+0.07 


A634 Check 


4 


0.0 -- 


13 


81 


2.03±0.83 



For the 1 0 inch pot assay, at 4 weeks post infestation plant height was recorded and a root 
damage rating was given (Iowa 1-6 scale; Hills, T.M. and D.C. Peters. 1971 ; A method of 
evaluating post planting insecticide treatments for control of western corn rootworm larvae. 
Journal of Economic Entomology 64: 764-765.). 



Results of the 10 inch pot assay are shown in Table 3. Plants expressing Cry3Bb protein had 
significantly less feeding damage and were taller than the non-expressing plants. Event 16, the 
higher of the two expressing events provided nearly complete control. The negative treatments 
had very high root damage ratings indicating very high insect pressure. The positive mean root 
damage ratings were 3.4 and 2.2 for event 6 & 16, respectively. Mean RDR for the negative 
treatment was 5.0 & 5.6. 

Table 3. Effect of Cry3Bb expressed in corn in controlling WCR larval feeding in a 10 inch pot 
assay. 







Root 




Plant 


Event Treatment 


N 


(ppm) RDR+SD 


Height (cm) 


6 Negative 


7 


0.0 


5.0±1.41 


49.7+18.72 


6 Positive 


5 


7.0 


3.4+1.14 


73.9+8.67 


16 Negative 


5 


0.0 


5.6±0.89 


61.2±7.75 


16 Positive 


5 


55.0 


2.2±0.84 


83.8±27.15 
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In summary, corn plants expressing Cry3Bb protein have a significant biological effect on WCR 
larval development as seen in the seedling assay. When challenged with very high infestation 
levels, plants expressing the Cry3Bb protein were protected from WCR larval feeding damage as 
illustrated in the 10 inch pot assay. 

Example 3 - Increased Expression of a Cry3Bb protein in transgenic maize 

Expression of a Cry3Bb protein was compared in corn plants transformed with standard 
or preferredCry3Bb expression vectors. Plants transformed with the improved vectors 
consistently demonstrated significantly higher levels of expression of Cry3Bb when compared to 
plants transformed with the standard Cry3Bb vectors. A standard Cry3Bb plant expression 
vector pMON33710 contains an expression cassette composed of an enhanced CaMV35S 
promoter sequence (P-CaMV.35S, SEQID NO:29), a Zea mays Hsp70 intron sequence (I- 
Zm.Hsp70, SEQID NO:33), a non-naturally occurring sequence encoding Cry3Bb variant 
protein vl 1231 (Bt.cry3Bb.vl 1231, SEQID NO: 7), and a nopaline synthase transcription 
termination and polyadenylation sequence (T-AGRtu.nos, SEQID NO:34). Another standard 
Cry3Bb plant expression vector pMON33709 contains an expression cassette composed of an 
enhanced CaMV35S promoter sequence (P-CaMV.35S, SEQID NO:29), a Zea mays Hsp70 
intron sequence (I-Zm.Hsp70, SEQID NO:33), a Zea mays CTP encoding sequence (TS- 
Zm.rbcl, SEQID NO:25), a non-naturally occurring sequence encoding Cry3Bb variant protein 
vl 1231 (Bt.cry3Bb.vl 1231, SEQID NO:7), and a nopaline synthase transcription termination 
and polyadenylation sequence (T-AGRtu.nos, SEQID NO:34). The plant expression vector 
pMON25097 is improved compared to pMON33710 as judged by Cry3Bb expression levels in 
planta, and contains an expression cassette comprising a non-naturally occurring CaMV35S 
AS4 promoter sequence (P-CaMV.AS4, SEQID NO:30), a wheat chlorophyll A/B binding 
protein untranslated leader sequence (L-Ta.hcbl, SEQID NO:3 1), a rice actin intron sequence (I- 
Os.Actl, SEQID NO:32), and a non-naturally occurring sequence encoding Cry3Bb variant 
protein 1 123 lmvl (1 1098) (Bt.cry3Bb.l 1231mvl, SEQID NO:9) linked to a wheat heat shock 
Hspl7 transcription termination and polyadenylation sequence (T-Ta.Hspl7, SEQID NO:35). 
Another preferred vector is pMON25096, which contains an expression cassette (SEQID 
NO: 17) comprising a non-naturally occurring CaMV35S AS4 promoter sequence (P- 
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CaMV.AS4, SEQID NO:30), a wheat chlorophyll A/B binding protein untranslated leader 
sequence (L-Ta.hcbl, SEQID NO:31), a rice actin intron sequence (I-Os.Actl, SEQID NO:32), a 
Zea mays CTP encoding sequence (TS-Zm.rbcl, SEQID NO:25), and a non-naturally occurring 
sequence encoding Cry3Bb variant protein 1 1231mvl (Bt.cry3Bb.l 1231mvl, SEQID NO:9) 
linked to a wheat heat shock Hspl7 transcription termination and polyadenylation sequence (T- 
Ta.Hspl7, SEQID NO:35). All vectors contain an identical cassette linked to the Cry3Bb 
expression cassette which confers paromomycin resistance to transformed plant tissue. This 
resistance cassette consists of an enhanced CaMV35S promoter sequence, and a neomycin 
phosphotransferase coding sequence linked to a nopaline synthase transcription termination and 
polyadenylation sequence. A summary of the standard and improved vectors is presented in 
Table 4. Transgenic corn plants resistant to paromomycin were derived essentially as described 
in U. S. Patent 5,424,412 (1995). 



Table 4 

Plant Expression Vector Summary 



Vector 


Expression Cassette 


Selection 
Cassette 


pMON33709 


35S/HSP70/ZmRBC/vll231/NOS 


e35S/nptII /nos 


pMON33710 


e35S/HSP70/11231v/nos 


e35S/nptII /nos 


pMON33722 


AS4/TaCAB/OsActl/ZmRBC/vll231/t 
ahspl7 


e35S/nptII/nos 


pMON33723 


AS4//TaCAB/OsActl/ 
vll231/tahspl7 


e35S/nptII /nos 


pMON25096 


AS4/TaCAB/OsActl/ZmRBC/11231mvl 
/tahspl7 


e35S/nptII /nos 


pMON25097 


AS4/TaCAB/OsActl/11231mvl/tahsp 

17 


e35S/nptII /nos 


pMON33741 


AS4/TaCAB/OsActl/11231mv2/tahsp 

17 


e35S/nptII /nos 


pMON33748 


e35S/TaCAB/OsActl/11231mv2/tahs 
P 17 


e35S/nptII /nos 
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Maize leaf protoplasts were electroporated with standard vectors (pMON33709 or 
pMON33710) or improved vectors (pMON33722, pMON33723, pMON25096, pMON25097, 
pMON33741) as described ( Sheen. Plant Cell 2:1027-1038. 1990) and transient expression of 
Cry3Bb variant proteins was compared by ELISA and Western Blot analysis methods. The 
ELISA used a rabbit anti-Cry3B chromatography purified IgG capture antibody raised against 
Cry3B 1 1231, a sample of that antibody conjugated to alkaline phosphatase as the secondary 
detecting antibody, and a purified Cry3Bb native protein as a standard. Comparison of the ratio 
of Cry3Bb to neomycin phosphotransferase (Npt II) expression levels by ELISA indicated that 
approximately two-fold increases in the normalized expression levels of Cry3Bb variant protein 
1 1231 were obtained with improved vectors pMON33723 and pMON33722 relative to the 
standard vectors pMON33710 and pMON33709, respectively.(Expt. 1, Table 5). , Differences in 
Cry3Bb expression are directly ascribed to the improved expression cassette in the improved 
vectors rather than to differences in protoplast electroporation efficiency since expression of 
Cry3Bb protein is normalized to Npt II produced by the identical linked nptll gene present in all 
vectors. The most preferred improved vectors such as pMON25096, pMON25097, and 
pMON33741 expressed approximately 10-fold higher normalized levels of Cry3Bb and variant 
Cry3Bb protein than the preferred improved vectors such as pMON33722 or pMON33723 
(Table 5, Expt. 2, 3). Finally, the equally preferred pMON33741 and pMON25097 vectors 
yielded roughly equivalent normalized Cry3Bb expression (Table 5, Expt. 4) 
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Table 5 



Transient Cry3Bb and Cry3BbVariant Expression 
in Corn Leaf Protoplasts 
(normalized to Nptll expression) 



Expt. 1 


pMON33710 
5.79 


pMON33723 
12.3 




pMON33709 
2.7 


pMON33722 
7.7 


Expt. 2 


pMON33722 
1.9 


pMON25096 
26.2 




pMON33723 
3.7 


pMON25097 
37.5 


Expt.3 


pMON33723 
30 


pMON33741 
319 


Expt. 4 


pMON33741 
20 


pMON25097 
25 



Since the improved expression cassette in pMON25097 encodes the Cry3Bb 1 1231mvl 
(1 1098) variant toxin, and the standard cassette in pMON33710 encodes the Cry3Bb vl 123 1 
variant which differ by a single amino acid, the intrinsic immunoreactivity of the two proteins in 
the ELISA assay was compared. Subsequent ELISA experiments with Cry3Bb vl 123 1 and 
1 123 lmvl (1 1098) variant proteins produced in and purified from B. thuringiensis indicate that 
the two proteins have similar levels of immunoreactivity. Consequently, the observed increase in 
levels of Cry3Bb 1 123 lmvl (1 1098) protein produced from the expression cassette in 
pMON25097 is due to increased expression levels rather than a difference in immunoreactivity. 
Protein blot analyses confirm that the increased level of cross reactive material produced in 
maize protoplasts from the improved Cry3Bb expression cassette in pMON25097 were due to 
increased accumulation of an approximately 60,000 Mr protein immunoreactive with Cry3B 
antiserum that also co-migrates with Cry3Bb variant 1 1231 protein produced in a recombinant 
cry- B. thuringiensis strain from pEG7174. Equally preferred and improved Cry3Bb variant 
protein expression cassettes in pMON33741 and pMON33748 that encode Cry3Bb.l 1231 also 
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exhibit increased expression levels of Cry3Bb relative to expression observed from the standard 
cassette in pMON33710. These results confirm that expression differences are due to the 
improved compositions disclosed herein rather than to differences in the intrinsic 
immunoreactivity of the different variants. 

Root tissue from transgenic plants in the R 0 stage independently obtained after 
transformation with an improved vectors (pMON33723, 25097, ,) or with a standard vector 
(pMON 33710) was subjected to quantitative analysis of Cry3Bb protein levels by a quantitative 
ELISA assay. Comparison of Cry3Bb or Cry3Bb protein variant expression levels in improved 
and standard vector transformed corn plants show that Cry3Bb.l 1231 variant expression does 
not exceed 50 ppm in the standard pMON33710 transgenics while Cry3Bb.l 1098 (1 1231mvl) 
expression in the improved pMON25097 transgenics is frequently higher than 50 ppm (Table 6). 
Protein blot analyses confirm that the increased level of cross reactive material produced by 
pMON25097 (improved) were due to increased accumulation of an approximately Mr 60,000 
protein that migrates with Cry3Bbl standard from B. thuringiensis . Other improved Cry3Bb 
protein variant expression cassettes found in pMON33741 and 33748 also consistently yield 
select independently transformed events (ITE's) with Cry3Bb protein variant levels greater than 
100 PPM whereas the standard vectors have never given rise to ITE's with greater than 50 PPM 
of Cry3Bb protein variant (Table 7). High level expression is evident in both the H99 and A634 
maize genotypes, indicating that the compositions disclosed herein have broad utility to many 
varieties of commercially cultivated maize. Such select high expressing Cry3 protein variant 
lines obtained with the vectors described herein are expected to be especially advantageous in 
conferring high levels of protection to insect feeding damage and in reducing the incidence of 
insect resistance to Cry3 insecticidal proteins. 
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Table 6 

Comparison of Cry3Bb Expression in Ro Corn Transformed with Standard and Improved Cry3Bb Protein 
Variant Expression Cassettes 



Cry3B Expression Level (ppm) 



Vector 
(genotype) 


Total 
Events 


5-10 ppm 


10-50 ppm 


50-100 ppm 


100-200 ppm 


>200 ppm 


L25097 

A634 
H99 


45 
589 


3 

32 


7 

36 


5 


3 


3 
5 


L33710 

A634 
H99 


22 
336 


2 
13 


2 
15 








L33723 

A634 
H99 


0 

67 


6 


9 









5 Table 7 

Cry3Bb Expression in Ro Corn Transformed with Improved Cry3Bb Protein 
Variant Expression Cassettes 



Cry3B Expression Level (ppm) 

Vector Total 5-10 ppm 10-50 ppm 50-100 ppm 100-200 ppm >200 ppm 
Events 



L25097 














A634 


112 


7 


4 


5 


1 


4 


H99 


45 


1 


4 


2 






L33741 














H99 


108 


11 


5 


2 




4 


L33748 














A634 


82 


1 


11 


2 


2 


1 


H99 


209 


23 


13 


3 


3 


11 



Progeny derived from corn plants transformed with both the standard (pMON33709 and 
pMON33710) and preferred (pMON25096, 25097, 33722, 33723, 33726, 33741, and 33748) 
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cassettes expressing 10 ppm or more of Cry3Bb protein were further tested for resistance to Corn 
Rootworm (CRW) feeding damage in greenhouse or growth chamber based bioassays as 
previously described (English et al., WO 99/31248). Corn Rootworm resistant transgenic corn 
plants were obtained from essentially all of the preferred vectors (Table 8). For example, the 
improved pMON25096 vector was used to generate 89 independently transformed events 
(ITE's), 14 independent pMON25096 Fi progeny lines expressing 10 ppm or more of 
Cry3Bband 7 Fi progeny lines displaying significant levels of CRW resistance (an RDR rating > 
3.5 on a rating scale of 0-6). In contrast, not a single event with a RDR rating < 3.5 was 
obtained from 12 of the standard pMON33710 cassette F] progeny lines expressing 10 PPM or 
more of Cry3Bb protein variant. Failure to obtain CRW resistant lines with either of the 
standard vectors (pMON33709 or pMON33710) was not due to insufficient numbers of ITE's as 
over 300 ITE's from each of these two vectors were generated and screened for CRW resistant 
Fi progeny. Far fewer ITE's were generated with preferred vectors such as pMON33722, 
pMON33723, and pMON25096, yet all ultimately gave rise to CRW resistant Fi progeny lines . 

Table 8 

Numbers of CRW resistant independent transformation events obtained with the standard 



and improved Cry3Bb Protein Variant expression cassettes 



Expressi 
on cassette 


Genotyp 
e 


Total 
Number of ITE's 


Number 
of ITE's Tested 


Number 
and Percent of 
ITEs with 
RDR<3.5 


L33709 


H99 


318 


11 


0 


L33710 


H99 


336 


10 


0 




A634 


22 


2 


0 


L25096 


H99 


52 


4 


2 (50%) 




A634 


37 


10 


5 (50%) 
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L25097 


H99 


634 


17 


10 




A634 


157 


18 


(59%) 
8 (44%) 


L33722 


H99 


107 


10 


6 (60%) 


L33723 


H99 


93 


7 


3 (43%) 


L33726 


H99 


65 


6 


5 (83%) 




A634 


10 


0 




L33727 


H99 
A634 


86 
1 


0 

1 


0 


33736A 


H99 


3 


3 


2 (67%) 


BI 










L33741 


H99 


108 


1 


0 


L33748 


H99 


223 


6 


3 (50%) 




A634 


82 


7 


4 (57%) 


L33749 


H99 


73 


14 


13 


ABI 








(93%) 



In examples provided herein, experimental evidence that substantially equivalent 
compositions based on the improvements disclosed herein yield equivalent improvements in 
performance relative to the previously disclosed standards. More specifically, we demonstrate 
that improved compositions encoding both the Cry3Bb. 1 1098 and Cry3Bb. 1 1231 variants both 
yield equivalently improved performance relative to the previously disclosed standard 
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compositions encoding Cry3Bb. 1 1 23 1 . It thus follows that use of other Cry3B variants with 
specific biological activities that are greater than or equal to Cry3Bb.l 1098 or Cry3Bb.l 1231 is 
contemplated by and within the scope of this invention. For example, improved vector 
compositions encoding Cry3Bb variants include 1 123 1, 1 1084, 1 1098, 1 1247, and others as set 
forth in English et al., US Application Serial No.'s 08/993,170, 08/993,722, 08/993,755, and 
08/996,441, all filed Dec. 18, 1997 can be derived from pMON25095 using standard 
mutagenesis procedures in a manner essentially equivalent to the construction of pMON33740. 

5.4 

Example 4- Preferred Expression Cassettes Confer Resistance to CRW damage in Field 
Tests 

Corn plants genetically modified to express Cry3Bb protein variants derived from the preferred 
vectors pMON33722, pMON33723, pMON25096, and pMON25097 were evaluated in the field 
for control of western corn rootworm, Diabrotica vergifera vergifera LeConle (WCR). None of 
the corn plants transformed with the standard vectors were advanced to field testing as none 
displayed adequate Corn Rootworm control in greenhouse tests (Example 3. Table 8). The 
efficacy trials were held at a Monsanto research farm in Jerseyville, Illinois and at the Northern 
Grain Insects Research Laboratory, USDA ARS research station in Brookings, South Dakota. 
These trials serve to evaluate performance of the preferred cassettes in the field under heavy 
insect pressure and to compare their performance to the current commercially available 
insecticides . 

Seventeen independent transformation events (ITE) were selected for field evaluation based on 
greenhouse performance. The amount of seed available for the field evaluation varied for each 
ITE. Of these 17 events, only seven were planted at the Brookings research station. The field 
design for the Brookings' location was a randomized complete block (RCB) with 2 replications, 
where each plot was a single row containing a maximum of 30 plants. All 17 ITE's were planted 
at the Jerseyville location, where the design was a RCB with a maximum of 4 replications, 1 row 
plots each, where the number of replications depended on the seed available from each ITE. 
Because of this, the number of replications at Jerseyville ranged from two to four. Additional 
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treatments included an untreated check (nontransgenic corn) and commercial insecticides, 
including Counter®, Lorsban®, and Force®. The insecticide treatments were only at the 
Jerseyville location. The insecticides were applied as an eight inch band at planting using the 
recommended rates. 

Planting dates where May 28 th and June 3 rd for the Jerseyville & Brookings, respectively. The 
study was performed as follows; plots were infested with CRW eggs at planting with 1,600 eggs 
per foot of row, approximately 800 eggs per plant. At the VI - V2 plant growth stage, plants 
were analyzed for presence of the Cry3Bb protein variant expression using an ELISA. Plants 
negative for the gene were culled from the plot. 

At the end of the CRW larval feeding stage, when maximum damage would have occurred, all 
remaining plants in each plot were evaluated for root feeding injury using a 1 - 6 root damage 
rating (RDR) scale described by Hills and Peters (1971). The RDR scale is as follows; 

Root Damage Rating: 

1. No feeding scars 

2. Visible feeding scars, but no roots pruned to within 4 cm of the stalk 

3. One or more nodal roots pruned to within 4 cm of the stalk, but less than one nodes worth of 
roots 

4. One node worth of pruned roots 

5. Two nodes worth of pruned roots 

6. Three or more nodes worth of pruned roots 

On July 25* and August 3 rd the field trials were evaluated at Jerseyville and Brookings, 
respectively. The average RDR's for all treatments are illustrated in Table 9. Of the seventeen 
ITE's evaluated, 16 ITE's controlled CRW feeding, < 3.0 RDR. Two of the three chemical 
standards had a RDR less than 3.0. Force® had a root damage rating of 3.2. Except for one ITE, 
WCR20, all treatments were significantly better than the checks (p < .01) but did not differ 
significantly from each other. Figure one illustrates the difference in larval feeding damage 
between a transgenic CRW resistant plant and an untreated check. 

Even though the ITE's did not differ significantly from the chemical standards with respect to 
root damage rating, the amount of feeding injury observed on roots from the insecticide 
treatments were greater than the roots expressing Monsanto 's proprietary gene. The lack of 
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difference between root damage rating is an artifact of the root rating scale, where this scale is 
based on "pruned" roots. Hills and Peters describe a pruned root as being less than 4 cm in 
length due to CRW feeding. Therefore, root masses without a "pruned" root but visible feeding 
scares are given a rating of 2. Roots outside of the zone of protection from the insecticide 
5 treatments had many more feeding scars and in most cases the root tips were destroyed as 
compared to the ITE's. Unlike the insecticide treatments, the transgenic plants express the CRW 
resistant gene throughout the entire root mass. But because the mechanism for control of the 
transgenic plant is orally mediated, a minimum amount of feeding is required to control any 
further injury by the CRW larvae. This minimal feeding requirement resulted in a RDR of 2. 

10 

In summary, corn plants expressing Cry3Bb protein variants were fully protected from CRW 
larval feeding. This level of protection eliminates the need for an insecticide treatment. 
Insecticides, including organophosphates, carbamates and pyrethroids are incorporated into the 
soil on over 16 million corn acres annually to control CRW. CRW resistance technology has the 

is potential to significantly reduce the current exposure level of these insecticides to the 
environment. The benefits of shifting away from soil insecticides to a transgenic approach are 
impressive and include a reduction in potential human health and safety risks, reduced direct 
impacts on nontarget organisms, reduced contamination of surface and ground water supplies, 
decreased pesticide container disposal problems, and general compatibility with other pest 

20 management and agronomic programs. 

Table 9. 

Corn rootworm root feeding damage (RDR) means for corn independent transformation events 
containing Monsanto's proprietary CRW resistant gene. 

25 

Root Damage Rating (RDR) 



Treatment 


Jerseyville 


Brookings 


Average (RDR) 


pMON 25097-1 


2.3 


1.9 


2.1 


pMON 33722-1 


2.6 


2.3 


2.5 


pMON 33723-1 


2.6 


2.9 


2.8 


pMON 33723-2 


2.6 


2.0 


2.3 


pMON33722-2 


2.5 


1.9 


2.2 


pMON 25096-1 


2.8 


2.5 


2.7 


pMON 25097-2 


2.5 


2.3 


2.4 
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pMON 25096-2 


2.4 


n/a 


2.4 


pMON 25097-3 


2.6 


n/a 


2.6 


pMON 25096-3 


2.2 


n/a 


2.2 


pMON 25097-4 


2.2 


n/a 


2.2 


pMON 25096-4 


2.6 


n/a 


2.6 


pMON 33723-3 


2.5 


n/a 


2.5 


pMON 25097-5 


3.0 


n/a 


3.0 


pMON 25097-6 


4.0 


n/a 


4.0 


pMON 25097-7 


2.2 


n/a 


2.2 


pMON 33722-3 


2.6 


n/a 


2.6 


COUNTER ® 


2.4 


n/a 


2.4 


LORSBAN ® 


2.4 


n/a 


2.4 


FORCE ® 


3.2 


n/a 




CHECK 


4.1 


4.1 


4.1 



5.5 

Example 5 - Transformation of Tobacco Chloroplast with a cry3B gene 

Recombinant plants can be produced in which only the mitochondrial or chloroplast 
DNA has been altered to incorporate the molecules envisioned in this application. Promoters 
which function in chloroplasts have been known in the art (Hanley-Bowden et al., Trends in 
Biochemical Sciences 12:67-70, 1987). Methods and compositions for obtaining cells 
containing chloroplasts into which heterologous DNA has been inserted have been described, for 
example by Daniell et al. (U.S. Pat. No. 5,693,507; 1997) and Maliga et al. (U.S. Pat. No. 
5,451,513; 1995). A vector can be constructed which contains an expression cassette from 
which a Cry3B protein could be produced. A cassette could contain a chloroplast operable 
promoter sequence driving expression of a cry3B crystal protein gene, constructed in much the 
same manner as other polynucleotides herein, using thermal amplification methodologies, 
restriction endonuclease digestion, and ligation etc. A chloroplast expressible gene would 
provide a promoter and a 5' untranslated region from a heterologous gene or chloroplast gene 
such as psbA, which would provide for transcription and translation of a DNA sequence 
encoding a Cry3B protein in the chloroplast; a DNA sequence encoding Cry3B protein; and a 
transcriptional and translational termination region such as a 3' inverted repeat region of a 
chloroplast gene that could stabilize an expressed cry3B mRNA. Expression from within the 
chloroplast would enhance cry3B gene product accumulation. A host cell containing 
chloroplasts or plastids can be transformed with the expression cassette and then the resulting 
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cell containing the transformed chloroplasts can be grown to express the Cry3B protein. A 
cassette may also include an antibiotic, herbicide tolerance, or other selectable marker gene in 
addition to the cry3B gene. The expression cassette may be flanked by DNA sequences obtained 
from a chloroplast DNA which would facilitate stable integration of the expression cassette into 
the chloroplast genome, particularly by homologous recombination. Alternatively, the 
expression cassette may not integrate, but by including an origin of replication obtained from a 
chloroplast DNA, would be capable of providing for replication of the heterologous cry3B gene 
in the chloroplast. Plants can be generated from cells containing transformed chloroplasts and 
can then be grown to produce seeds, from which additional plants can be generated. Such 
transformation methods are advantageous over nuclear genome transformation, in particular 
where chloroplast transformation is effected by integration into the chloroplast genome, because 
chloroplast genes in general are maternally inherited. This provides environmentally "safer" 
transgenic plants, virtually eliminating the possibility of escapes into the environment. 
Furthermore, chloroplasts can be transformed multiple times to produce functional chloroplast 
genomes which express multiple desired recombinant proteins, whereas nuclear genomic 
transformation has been shown to be rather limited when multiple genes are desired. 
Segregational events are thus avoided using chloroplast or plastid transformation. Unlike plant 
nuclear genome expression, expression in chloroplasts or plastids can be initiated from only one 
promoter and continue through a polycistronic region to produce multiple peptides from a single 
mRNA. 

The expression cassette would be produced in much the same way that other plant 
transformation vectors are constructed. Plant chloroplast operable DNA sequences can be 
inserted into a bacterial plasmid and linked to DNA sequences expressing desired gene products, 
such as Cry3B proteins, so that Cry3B protein is produced within the chloroplast, obviating the 
requirement for nuclear gene regulation, capping, splicing, or polyadenylation of nuclear 
regulated genes, or chloroplast or plastid targeting sequences. An expression cassette comprising 
a cry3B gene, which is either synthetically constructed or a native gene derived directly from a 
B. thuringiensis genome or a B. thuringiensis episomal element, would be inserted into a 
restriction site in a vector constructed for the purpose of chloroplast or plastid transformation. 
The cassette would be flanked upstream by a chloroplast or plastid functional promoter and 
downstream by a chloroplast or plastid functional transcription and translation termination 
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sequence. The resulting cassette would be incorporated into the chloroplast or plastid genome 
using well known homologous recombination methods. 

Alternatively, chloroplast or plastid transformation could be obtained by using an 
autonomously replicating plasmid or other vector capable of propagation within the chloroplast 
or plastid. One means of effectuating this method would be to utilize a portion of the chloroplast 
or plastid genome required for chloroplast or plastid replication initiation as a means for 
maintaining the plasmid or vector in the transformed chloroplast or plastid. A sequence enabling 
stable replication of a chloroplast or plastid epigenetic element would easily be identified from 
random cloning of a chloroplast or plastid genome into a standard bacterial vector which also 
contains a chloroplast or plastid selectable marker gene, followed by transformation of 
chloroplasts or plastids and selection for transformed cells on an appropriate selection medium. 
Introduction of an expression cassette as described herein into a chloroplast or plastid replicable 
epigenetic element would thus provide an effective means for localizing a Cry3B B. 
thuringiensis 8-endotoxin to the chloroplast or plastid. 

5.6 

Example 6: Targeting Cry3Bb or Variant Cry3Bb Protein to Plastids 

Improved expression by targeting recombinant insecticidal protein to the 
chloroplast may result in tissues which are light exposed and which accumulate mature 
chloroplasts as a result. Improving expression in leaf tissue to inhibit leaf-feeding pests 
susceptible to the insecticidal protein could be advantageous. To test this, two plasmids, 
pMON33709 and pMON33710 were constructed which were isogenic with respect to all 
elements with the exception of a plastid or chloroplast targeting sequence linked in frame to the 
insecticidal Cry3Bb improved variant in pMON33709. R 0 corn plants were recovered and were 
shown to contain and express the transgene by ELISA. Six pMON33709 lines and sixteen 
pMON33710 lines were recovered which expressed the transgene in both the root and the leaves. 
Leaf and root tissue were recovered and analyzed for the presence and amount of Cry3Bb variant 
protein, measured in parts per million. The results are shown in Table 10. 

Table 10. 
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Comparison of Non-Targeted and Plastid Targeted Leaf vs Root Expression of Cry3Bb Variant 



vll231 in R 0 Corn Transformation Events 



R0# 


Event # 


Construct 


Tissue 


ppm 11231(ug/g tissue) 


R053608 


2027-05-01 


L33709 


Leaf 


14.69 


R053608 


2027-05-01 


L33709 


Root 


3.97 


R053621 


2028-06-06 


L33709 


Leaf 


22.65 


R053621 


2028-06-06 


L33709 


Root 


0.10 


R053643 


2029-03-09 


L33709 


Leaf 


1.05 


R053643 


2029-03-09 


L33709 


Root 


3.83 


R053675 


2028-03-06 


L33709 




7.13 


R053675 


2028-03-06 


L33709 


Root 


2.23 


R053688 


2028-04-02 


L33709 


— L — f — 




R053688 


2028-04-02 


L33709 


Root 


9.83 


R053690 


2028-04-02 


L33709 


Leaf 


98.69 


R053690 


2028-04-02 


L33709 


Root 


6.38 


R053708 


2027-01-02 








R053708 


2027-01-02 


L33710 


Root 


4.94 


R053781 


2028-02-19 




— L" — f — 




R053781 


2028-02-19 


L33710 


Root 


4.72 


R053785 


2027-04-06 


L33710 


Leaf 


21 .97 


R053785 


2027-04-06 


L33710 


Root 


7.20 


R053799 


2028-01-16 


L33710 


Leaf 


12.41 


R053799 


2028-01-16 


L33710 


Root 


6.19 


R053800 


2028-01-16 


L33710 


Leaf 


5.69 


R053800 


2028-01-16 


L33710 


Root 


3.32 


R053801 


2028-01-16 


L33710 


Leaf 


16.19 


R053801 


2028-01-16 


L33710 


Root 


7.80 


R053824 


2027-01-1 1 


L33710 


Leaf 


6.93 


R053824 


2027-01-1 1 


L33710 


Root 


10.35 


R053838 


2030-08-12 


L33710 


Leaf 


14.32 


R053838 


2030-08-12 


L33710 


Root 


5.64 


R053857 


2030-08-08 


L33710 


Leaf 


12.70 


R053857 


2030-08-08 


L33710 


Root 


3.97 


R053858 


2028-02-32 


L33710 


Leaf 


2.33 


R053858 


2028-02-32 


L33710 


Root 


4.15 


R053859 


2028-02-32 


L33710 


Leaf 


9.39 


R053859 


2028-02-32 


L33710 


Root 


5.76 


R053904 


2027-02-03 


L33709 


Leaf 


226.05 


R053904 


2027-02-03 


L33709 


Root 


1.55 


R053923 


2029-01-08 


L33710 


Leaf 


12.16 


R053923 


2029-01-08 


L33710 


Root 


11.77 


R053924 


2029-01-08 


L33710 


Leaf 


10.74 


R053924 


2029-01-08 


L33710 


Root 


7.94 


R053928 


2029-01-05 


L33710 


Leaf 


14.86 


R053928 


2029-01-05 


L33710 


Root 


3.84 


R053929 


2029-01-05 


L33710 


Leaf 


15.04 


R053929 


2029-01-05 


L33710 


Root 


3.49 
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All but one pMON33709 line (Ro53643) produced between 3 to 15 times more 
insecticidal protein in the leaves than in the root tissue. The one line that produced less in the 
leaves also produced less than 1 ppm in the root, whereas the other lines produced up to almost 
1 00 ppm in the leaves. The amount of Cry3Bb variant protein expressed was even more variable 
in the non-targeted lines derived from pMON33710 transformation events which were 
determined to be expressing the recombinant protein in both leaf and root tissues. While most of 
these lines produced more protein in the leaves than in the roots, some also produced more in the 
roots, but the difference between the amount produced in the roots in those improved root- 
expressors was less substantial than in the single pMON33709 targeted event. Also, the range of 
expression levels was less pronounced in the non-targeted events with one exception. 
Surprisingly, one line (Ro53904) produced substantially more protein in the leaves than was 
observed in any other line, targeted or non-targeted. This line would be expected to be a 
candidate for a commercial line directed to protection against Coleopteran pests which feed on 
leaf tissues. Conversely, lines such as Ro53923 would be expected to be optimum candidates for 
protecting corn plants against root-feeding pests such as corn rootworms. 

The data in summary indicates that targeting the Bt Cry3B protein to the plastid or 
chloroplast improves the accumulation of the protein in leaf tissue but not in root tissue, and 
improves the overall expression of the protein in leaves in plants transformed with such 
constructs as compared to the levels of expression observed in root tissues in those same plants. 

In view of the above, it will be seen that the several advantages of the invention are achieved 
and other advantageous results attained. As various changes could be made in the above 
methods and compositions without departing from the scope of the invention, it is intended that 
all matter contained in the above description and shown in the accompanying drawings shall be 
interpreted as illustrative and not in a limiting sense. 

In addition, all references referred to in this application are herein incorporated by reference in 
their entirety. 
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Claims: 

1 . A plant comprising a polynucleotide sequence comprising an expression cassette 
containing a linear arrangement of genetic sequences which function together in plant cells to 
achieve improved expression of at least an insecticidal portion of a protein or amino acid 

5 sequence variant thereof from a nucleic acid coding sequence in said plant cells, said protein or 
variant being derived from a Bacillus thuringiensis Cry3 5-endotoxin, said variant exhibiting an 
insecticidal activity toxic to a Coleopteran insect pest which is at least equivalent to said Cry 3, 
wherein said genetic sequences comprise a promoter operably linked in linear sequence to an 
untranslated leader, an intron, said nucleic acid coding sequence, and a transcription termination 

10 and polyadenylation sequence. 

2. The plant according to claim 1 wherein said endotoxin protein or amino acid 
sequence variant thereof is a Cry3B protein or variant thereof, said variant exhibiting an 
insecticidal activity at least equivalent to said Cry3B. 

15 

3. The plant according to claim 2 wherein said endotoxin protein or amino acid 
sequence variant thereof is a Cry3Bb protein or variant thereof, said variant exhibiting an 
insecticidal activity at least equivalent to said Cry3Bb. 

20 4. The plant according to claim 3 wherein said endotoxin protein or amino acid 

sequence variant thereof is a Cry3Bb2 protein or variant thereof, said variant exhibiting an 
insecticidal activity at least equivalent to said Cry3Bb2. 

5. The plant according to claim 1 wherein said endotoxin protein or amino acid 
25 sequence variant thereof is a protein as set forth in SEQID NO:2 or SEQID NO:4 or a variant 

thereof, said variant exhibiting an insecticidal activity at least equivalent to the protein as set 
forth in SEQ ID NO:2. 

6. The plant according to claim 5 wherein said endotoxin protein or amino acid 
30 sequence variant thereof is a protein or a variant thereof selected from the group consisting of 

SEQID NO:4, SEQID NO:8, SEQID NO:10, and SEQID NO: 12, said variant exhibiting an 
insecticidal activity at least equivalent to the protein as set forth in SEQID NO:4. 
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7. The plant according to claim 1, wherein said genetic sequence comprising a 

promoter is selected from the group consisting of CaMV-35S-AS4 promoter, CaMV-e35S 
promoter, and CaMV-35S promoter, and POX promoter. 

5 8. The plant according to claim 1, wherein said genetic sequence comprising a 

promoter is selected from the group consisting of SEQID NO:30 and SEQID NO:29. 

9. The plant according to claim 1 wherein said genetic sequence comprising an 
untranslated leader is the wheat chlorophyll ab binding protein untranslated leader. 

10 

10. The plant according to claim 1 wherein said genetic sequence comprising an 
untranslated leader is SEQID NO:31. 

11. The plant according to claim 1 wherein said genetic sequence comprising an 
15 intron is selected from the group consisting of rice actin intron and HSP70 intron. 

12. The plant according to claim 1 wherein said genetic sequence comprising an 
intron is selected from the group consisting of SEQID NO:32 and SEQID NO:33. 

20 13. The plant according to claim 1 wherein said genetic sequence comprising a 

transcription termination and polyadenylation sequence is selected from the group consisting of 
Agrobacterium tumefaciens nopaline synthase transcription termination and polyadenylation 
sequence and the wheat hspl7 transcription termination and polyadenylation sequence element. 

25 14. The plant according to claim 1 wherein said genetic sequence comprising a 

transcription termination and polyadenylation sequence is selected from the group consisting of 
SEQID NO:34 and SEQID NO:35. 

15. The plant according to claim 1 wherein said genetic sequence comprising said 
30 nucleic acid coding sequence is selected from the group consisting of SEQID NO:l, SEQID 

NO:3, SEQID NO:7, SEQID NO:9, and SEQID NO:l 1. 

16. The plant according to claim 1 wherein said expression cassette is selected from 
the group consisting of SEQID NO:15, SEQID NO:38, SEQID NO:19, SEQID NO:21, and 

35 SEQID NO:23. 
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17. The plant according to claim 1 selected from the group consisting of a 
monocotyledonous plant and a dicotyledonous plant. 

1 8. The plant according to claim 1 7 which is a monocotyledonous plant. 

19. The plant according to claim 1 or a progeny of said plant, wherein said plant or 
said progeny comprise said polynucleotide sequence. 

20. A seed from the plant or progeny of claim 19. 

21 . A plant germinated from the seed of claim 20. 

22. A plant comprising a polynucleotide sequence comprising an expression cassette 



containing a linear arrangement of genetic sequences which function together in plant cells to 
achieve improved expression of a Bacillus thuringiensis Cry3 insecticidal 5-endotoxin protein or 
amino acid sequence variant thereof toxic to a Coleopteran insect pest from a nucleic acid 
coding sequence in said plant cells, said variant exhibiting an insecticidal activity at least 
equivalent to said Cry 3, wherein said genetic sequences comprise a promoter operably linked in 
linear sequence to an untranslated leader, an intron, a nucleotide sequence encoding a plastid or 
chloroplast targeting peptide linked in frame and adjacent to said nucleic acid coding sequence, 
and a transcription termination and polyadenylation sequence. 

23. The plant of claim 17 wherein said genetic sequence comprising a nucleotide 
sequence encoding a plastid or chloroplast targeting peptide is a Zea mays ribulose bis 
phosphate carboxylase synthase small subunit chloroplast targeting sequence. 

24. The plant of claim 17 wherein said genetic sequence comprising a nucleotide 
sequence encoding a plastid or chloroplast targeting peptide is SEQID NO:25. 

25. A plant comprising a polynucleotide sequence comprising an expression cassette 
containing a linear arrangement of genetic sequences which function together in plant cells to 
achieve expression from a nucleic acid sequence encoding an insecticidal Bacillus thuringiensis 
Cry3 5-endotoxin protein or amino acid sequence variant thereof toxic to a Coleopteran insect 
pest feeding on said plant, said variant exhibiting an insecticidal activity at least equivalent to 
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said Cry3, said genetic sequences comprising a promoter operably linked in linear sequence to 
an untranslated leader, an intron, said nucleic acid coding sequence, and a transcription 
termination and polyadenylation sequence, wherein said expression of said protein or variant in 
plant cells of said plant is improved to levels of protein or variant sufficient to protect said plant 
from Coleopteran pest infestation and sufficient to delay the onset of insect resistance to said 
protein or variant. 

26. The plant of claim 20 wherein said improved expression provides levels of 
protein or variant in plant cells from about 200 to about 500 parts per million of total plant 
protein. 

27. The plant of claim 20, wherein said improved expression provides levels of 
protein or variant in plant cells from about 100 to about 200 parts per million of total plant 
protein. 

28. The plant of claim 20, wherein said improved expression provides levels of 
protein or variant in plant cells from about 50 to about 100 parts per million of total plant 
protein. 

29. The plant of claim 20, wherein said improved expression provides levels of 
protein or variant in plant cells from about 10 to about 50 parts per million of total plant protein. 

30. The plant of claim 20, wherein said improved expression provides levels of 
protein or variant in plant cells from about 5 to about 10 parts per million of total plant protein. 

31. The plant of claim 20, wherein the percentage of plants obtained exhibiting said 
improved expression providing levels of protein or variant in said plant cells which are greater 
than about 200 parts per million of total cell protein is from about 0.5 to about 5 percent of all 
plants obtained after transformation and selection of events using said expression cassette to 
produce transgenic events. 

32. The plant of claim 20, wherein the percentage of plants obtained exhibiting said 
improved expression providing levels of protein or variant in said plant cells which are greater 
than about 100 parts per million of total cell protein is from about 0.5 to about 6 percent of all 
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plants obtained after transformation and selection using said expression cassette to produce 
transgenic events. 

33. The plant of claim 20, wherein the percentage of plants obtained exhibiting said 
improved expression providing levels of protein or variant in said plant cells which are greater 
than about 50 parts per million of total cell protein is from about 0.5 to about 8 percent of all 
plants obtained after transformation and selection using said expression cassette to produce 
transgenic events. 

34. The plant of claim 20, wherein the percentage of plants obtained exhibiting said 
improved expression providing levels of protein or variant in said plant cells which are greater 
than about 10 parts per million of total cell protein is from about 0.5 to about 18 percent of all 
plants obtained after transformation and selection using said expression cassette to produce 
transgenic events. 

35. A method for producing a transgenic plant comprising the steps of: 

a) introducing into the DNA of a plant cell or plant tissue a 
polynucleotide sequence comprising an expression cassette containing a linear 
arrangement of genetic sequences which function together in plant cells to 
achieve expression from a nucleic acid sequence encoding an insecticidal Bacillus 
thuringiensis Cry3 5-endotoxin protein or amino acid sequence variant thereof 
toxic to a Coleopteran insect pest feeding on said plant, said variant exhibiting an 
insecticidal activity at least equivalent to said Cry3, said genetic sequences 
comprising a promoter operably linked in linear sequence to an untranslated 
leader, an intron, said nucleic acid coding sequence, and a transcription 
termination and polyadenylation sequence; 

b) growing said plant cell or plant tissue in selective media to obtain a 
transgenic plant containing said polynucleotide sequence heritably incorporated 
into said DNA; and 

c) selecting a plant expressing said protein or variant; 
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wherein said selected plant exhibits improved expression of said protein or 
variant in said transgenic plant, said improved expression being improved to 
levels of protein or variant sufficient to protect said plant from Coleopteran pest 
infestation and sufficient to delay the onset of insect resistance to said protein or 
variant when produced in said transgenic plant. 

36. The method according to claim 35 wherein said step introducing said 
polynucleotide into the DNA of said plant cell or tissue is accomplished by nucleic acid coated 
ballistic particle bombardment, electroporation, or transformation competent Agrobacterium 
tumefaciens transformed with a plasmid containing said polynucleotide sequence. 

37. The method according to claim 36 wherein said transgenic plant is a monocot or a 
dicot. 

38. The method of claim 37 wherein said transgenic plant is a monocot. 

39. The method of claim 38 wherein said monocot is selected from the group 
consisting of corn, wheat, barley, rice, oats, grasses, and bananas. 

40. The method according to claim 35 wherein said selecting step is accomplished 
using a polynucleotide sequence comprising an expression cassette containing a promoter which 
functions in plants operably linked to nucleotide sequence encoding a selectable marker protein 
linked to a plant operable 3' transcription termination and polyadenylation sequence. 

41. The method according to claim 40 wherein the selectable marker protein is 
selected from the group consisting of herbicide tolerance proteins, antibiotic resistance proteins, 
proteins which catalyze substrates to generate visually observed colored products, and proteins 
which catalyze substrates to generate fluorescent or luminescent products. 

The method according to claim 40 wherein the selectable marker protein is selected from the 
group consisting of Nptll, GUS, LUX, Hyg, esterase, PhnO, EPSPS, and GOX. 

42. A vector comprising a polynucleotide sequence as set forth in claim 1 for use in 
transforming a plant, plant cell, or plant tissue, wherein said vector is selected from the group 
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consisting of a plasmid, a bacmid, an artificial chromosome, a linear single or double stranded 
DNA or RNA fragment, and a virus genome. 

43. An isolated and purified polynucleotide sequence comprising an expression 
cassette containing a linear arrangement of genetic sequences which function together in plant 
cells to achieve improved expression of at least an insecticidal portion of a protein or amino acid 
sequence variant thereof from a nucleic acid coding sequence in said plant cells, said protein or 
variant being derived from a Bacillus thuringiensis Cry3 5-endotoxin, said variant exhibiting an 
insecticidal activity toxic to a Coleopteran insect pest which is at least equivalent to said Cry3, 
wherein said genetic sequences comprise a promoter operably linked in linear sequence to an 
untranslated leader, an intron, said nucleic acid coding sequence, and a transcription termination 
and polyadenylation sequence, and wherein said expression cassette improves the expression of 
the protein or variant in a transgenic plant and, improves the number of transgenic events 
observed to express the protein or variant above a threshold level when said polynucleotide 
sequence is used for generating transformed plants. 

44. An isolated and purified polynucleotide sequence comprising one or more 
expression cassettes, each cassette comprising genetic sequence elements which function in 
plant cells to express a desired Bacillus thuringiensis 5-endotoxin insecticidal protein, chimera, 
fusion, or variant thereof from a nucleic acid coding sequence, wherein said coding sequence is 
linked upstream to a promoter sequence element, an untranslated leader sequence element, an 
intron sequence element, and said coding sequence is linked downstream to a transcription 
termination and polyadenylation sequence element, wherein said expression cassette improves 
the expression of the protein or variant in a transgenic plant and improves the percentage of 
transgenic events observed to express the desired protein above a threshold level when using 
said polynucleotide sequence for generating transformed plants. 

45. A method for controlling Coleopteran insect infestation in a field of crop plants 
comprising providing a transgenic plant on which said Coleopteran insect feeds, said transgenic 
plant comprising a polynucleotide sequence comprising an expression cassette containing a 
linear arrangement of genetic sequences which function together in plant cells to achieve 
improved expression of at least an insecticidal portion of a protein or amino acid sequence 
variant thereof from a nucleic acid coding sequence in said plant cells, said protein or variant 
being derived from a Bacillus thuringiensis Cry3 5-endotoxin, said variant exhibiting an 
insecticidal activity toxic to a Coleopteran insect pest which is at least equivalent to said Cry3, 
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wherein said genetic sequences comprise a promoter operably linked in linear sequence to an 
untranslated leader, an intron, said nucleic acid coding sequence, and a transcription termination 
and polyadenylation sequence. 

5 46. The method according to claim 45 wherein the genetic sequences which function 

together in said transgenic plant improves the expression of the protein or variant in a transgenic 
plant and improves the percentage of transgenic events observed to express the desired protein 
above a threshold level when using said polynucleotide sequence for generating transformed 
plants. 
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Figure 2 
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Figure 3 
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<110> Romano, Charles P. 

<120> Improved Expression of Cry3Bb in Corn 
<130> 38-21(15304) Cry3Bb Improved Exp. Corn 



<160> 43 

<170> Patentln Ver. 2.0 



<211> 1959 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 

<221> CDS 

<222> (1) . . (1956) 



<220> 

<223> Description of Artificial Sequence: naturally 

occurring nucleotide sequence encoding a Cry3Bbl 
amino acid sequence 

<400> 1 

atg aat cca aac aat cga agt gaa cat gat acg ata aag gtt aca cct 
Met Asn Pro Asn Asn Arg Ser Glu His Asp Thr lie Lys Val Thr Pro 



aac agt gaa ttg caa act aac cat aat caa tat cct tta get gac aat 
Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp Asn 



act gaa gac agt tct acg gaa gtg eta gac aac tct aca gta aaa gat 
Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys Asp 



gca gtt ggg aca gga att tct gtt gta ggg cag att tta ggt gtt gta 
Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val Val 



gga gtt cca ttt get ggg gca etc act tea ttt tat caa tea ttt ctt 
Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu 



i tgg aag get ttt atg gca 
) Trp Lys Ala Phe Met Ala 



caa gtt gaa gta ctg ata gat aag aaa ata gag gag tat get aaa agt 
Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys Ser 
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aaa got ctt gca gag tta cag ggt ctt caa aat aat ttc gaa gat tat 432 
Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr 
130 135 140 

gtt aat gcg tta aat tec tgg aag aaa aca cct tta agt ttg cga agt 480 
Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser 
145 150 155 160 

aaa aga age caa gat cga ata agg gaa ctt ttt tct caa gca gaa agt 528 
Lys Arg Ser Gin Asp Arg lie Arg Glu Leu Phe Ser Gin Ala Glu Ser 
165 170 175 

cat ttt cgt aat tec atg ccg tea ttt gca gtt tec aaa ttc gaa gtg 576 
His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val 
180 185 190 

ctg ttt eta cca aca tat gca caa get gca aat aca cat tta ttg eta 624 
Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu Leu 
195 200 205 

tta aaa gat get caa gtt ttt gga gaa gaa tgg gga tat tct tea gaa 672 
Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu 
210 215 220 

gat gtt get gaa ttt tat cat aga caa tta aaa ctt aca caa caa tac 720 
Asp Val Ala Glu Phe Tyr His Arg Gin Leu Lys Leu Thr Gin Gin Tyr 
225 230 235 240 



ggt tea act tat gat gca tgg gtc aaa ttt aac cgt ttt cgc aga gaa 
Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu 
260 265 270 

atg act tta act gta tta gat eta att gta ctt ttc cca ttt tat gat 
Met Thr Leu Thr Val Leu Asp Leu lie Val Leu Phe Pro Phe Tyr Asp 
275 280 285 

att egg tta tac tea aaa ggg gtt aaa aca gaa eta aca aga gac att 
lie Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp lie 
290 295 300 

ttt acg gat cca att ttt tea ctt aat act ctt cag gag tat gga cca 
Phe Thr Asp Pro lie Phe Ser Leu Asn Thr Leu Gin Glu Tyr Gly Pro 
305 310 315 320 

act ttt ttg agt ata gaa aac tct att cga aaa cct cat tta ttt gat 
Thr Phe Leu Ser lie Glu Asn Ser lie Arg Lys Pro His Leu Phe Asp 
325 330 335 

tat tta cag ggg att gaa ttt cat acg cgt ctt caa cct ggt tac ttt 
Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr Phe 



: agt ata gga tct agt aag aca att act t 
> Ser He Gly Ser Ser Lys Thr He Thr £ 
370 375 
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aaa tct act gaa cct gta caa aag eta age ttt gat gga caa aaa gtt 120 0 
Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val 
385 390 395 400 

tat cga act ata get aat aca gac gta gcg get tgg ccg aat ggt aag 124 8 
Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys 
405 410 415 

gta tat tta ggt gtt acg aaa gtt gat ttt agt caa tat gat gat caa 1296 
Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp Gin 
420 425 430 

aaa aat gaa act agt aea caa aca tat gat tea aaa aga aac aat gge 1344 
Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn Gly 
435 440 445 

cat gta agt gca cag gat tot att gac caa tta ccg cca gaa aca aca 1392 
His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr Thr 



gat gaa cca ctt gaa aaa gca tat agt cat cag ctt aat tac gcg gaa 
Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu 



tgt ttc tta atg cag gac cgt cgt gga aca att cca ttt ttt act tgg 1488 
Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr Trp 
485 490 495 



act caa ctt cca gta gtg aaa gca tat gec ttg tct tea ggt get tec 1584 
Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala Ser 



att att gaa ggt cca gga ttc aca gga gga aat tta eta ttc eta aaa 1632 
He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu Lys 
530 535 540 

gaa tct agt aat tea att get aaa ttt aaa gtt aca tta aat tea gca 1580 
Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala 
545 550 555 560 

gee ttg tta caa cga tat cgt gta aga ata cgc tat get tct ace act 1728 
Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
565 570 575 

aac tta cga ctt ttt gtg caa aat tea aac aat gat ttt ctt gtc ate 1776 
Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val He 
580 585 590 

tac att aat aaa act atg aat aaa gat gat gat tta aca tat caa aca 1824 
Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin Thr 
595 600 605 

ttt gat etc gca act act aat tct aat atg ggg ttc teg ggt gat aag 1872 
Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp Lys 
610 615 620 

aat gaa ctt ata ata gga gca gaa tct ttc gtt tct aat gaa aaa ate 1920 
Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He 
625 630 635 640 
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tat ata gat aag ata gaa ttt ate cca gta 
Tyr He Asp Lys He Glu Phe He Pro Val 
545 650 



Gin Leu 



1959 



<210> 2 
<211> 652 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 2 

Met Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr Pro 
15 10 15 

Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp Asn 
20 25 30 

Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg Met 
35 40 45 

Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys Asp 
50 55 60 

Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val Val 
65 70 75 80 

Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu 
85 90 95 

Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met Ala 
100 105 110 

Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys Ser 
115 120 125 

Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr 
130 135 140 

Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser 
145 150 155 160 

Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
165 170 175 

His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val 
180 185 190 

Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu Leu 
195 200 205 

Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu 
210 215 220 

Asp Val Ala Glu Phe Tyr His Arg Gin Leu Lys Leu Thr Gin Gin Tyr 
225 230 235 240 

Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu Arg 
245 250 255 

Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu 
260 265 270 

Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr Asp 
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275 280 285 

lie Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp lie 
290 295 300 

Phe Thr Asp Pro lie Phe Ser Leu Asn Thr Leu Gin Glu Tyr Gly Pro 



Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr Arg 
355 360 355 



Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val 
385 390 395 400 

Tyr Arg Thr lie Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys 



Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu 
465 470 475 480 



Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys He 



Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
565 570 575 
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Asn Glu Leu lie lie Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He 
625 S30 635 640 

Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 3 
<211> 1959 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 

<221> CDS 

<222> (1) . . (1956) 

<223> naturally occurring nucleotide sequence encoding a 
Cry3Bb2 amino acid sequence 

<400> 3 

atg aat cca aac aat cga agt gaa cat gat acg ata aag gtt aca cct 48 
Met Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr Pro 
15 10 15 

aac agt gaa ttg cca act aac cat aat caa tat cct tta get gac aat 96 
Asn Ser Glu Leu Pro Thr Asn His Asn Gin Tyr Pro Leu Ala Asp Asn 
20 25 30 

cca aat teg aca eta gaa gaa tta aat tat aaa gaa ttt tta aga atg 144 
Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg Met 
35 40 45 

act gaa gac agt tct acg gaa gtg eta gac aac tct aca gta aaa gat 192 
Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys Asp 
50 55 60 

gca gtt ggg aca gga att tct gtt gta ggg cag att tta ggt gtt gta 24 0 
Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val Val 
65 70 75 80 

gga gtt cca ttt get ggg gca etc act tea ttt tat caa tea ttt ctt 288 
Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu 
85 90 95 

gac act ata tgg eca agt gat get gac cca tgg aag get ttt atg gca 336 
Asp Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met Ala 
100 105 110 

caa gtt gaa gta ctg ata gat aag aaa ata gag gag tat get aaa agt 3 84 
Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys Ser 
115 120 125 

aaa get ctt gca gag tta cag ggt ctt caa aat aat ttc gaa gat tat 432 
Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr 
130 135 140 

gtt aat gcg tta aat tec tgg aag aaa aca cct tta agt ttg cga agt 480 
Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser 
145 150 155 160 

aaa aga age caa gat cga ata agg gaa ctt ttt tct caa gca gaa agt 528 
Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
165 170 175 
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cat ttt cgt aat tec atg ccg tea ttt gca gtt tec aaa ttc gaa gtg 576 

His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val 

180 185 190 

ctg ttt eta cca aca tat gca caa get gca aat aca cat tta ttg eta 624 
Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu Leu 
195 200 205 

tta aaa gat get caa gtt ttt gga gaa gaa tgg gga tat tct tea gaa 672 
Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu 
210 215 220 

gat gtt get gaa ttt tat cat aga caa tta aaa ctt acg caa caa tac 720 
Asp Val Ala Glu Phe Tyr His Arg Gin Leu Lys Leu Thr Gin Gin Tyr 
225 230 235 240 

act gac cat tgt gtc aat tgg tat aat gtt gga tta aat ggt tta aga 76 8 
Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu Arg 
245 250 255 

ggt tea act tat gat gca tgg gtc aaa ttt aac cgt ttt cgc aga gaa 816 
Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu 
260 265 270 

atg act tta act gta tta gat eta att gta ctt ttc cca ttt tat gat 864 
Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr Asp 
275 280 285 

gtt egg tta tac tea aaa ggt gtt aaa aca gaa eta aca aga gac att 912 
Val Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp He 
290 295 300 

ttt acg gat cca att ttt tea etc aat act ctt cag gag tat gga cca 960 
Phe Thr Asp Pro He Phe Ser Leu Asn Thr Leu Gin Glu Tyr Gly Pro 
305 310 315 320 

act ttt ttg agt ata gaa aac tct att cga aaa cct cat tta ttt gat 1008 
Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe Asp 
325 330 335 

tat tta cag ggt att gaa ttt cat acg cgt ctt caa cct ggt tac tct 1056 
Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr Ser 
340 345 350 

ggg aaa gat tct ttc aat tat tgg tct ggt aat tat gta gaa act aga 1104 
Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr Arg 
355 360 365 

cct agt ata gga tct agt aag aca att act tec cca ttt tat gga gat 1152 
Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly Asp 
370 375 380 

aaa tct act gaa cct gta caa aag tta age ttt gat gga caa aaa gtt 1200 
Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val 
385 390 395 400 

tat cga act ata get aat aca gac gta gcg get tgg ccg aat ggc aag 1248 
Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys 
405 410 415 

ata tat ttt ggt gtt acg aaa gtt gat ttt agt caa tat gat gat caa 1296 
He Tyr Phe Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp Gin 
420 425 430 
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aaa aat gaa act agt aca caa aca tat gat tea aaa aga aac aat ggc 13' 
Lys Asn Glu Thr 



cat gta ggt gca cag gat tct att gac caa tta cca cca gaa aca aca 13 92 
His Val Gly Ala Gin Asp Ser lie Asp Gin Leu Pro Pro Glu Thr Thr 
450 455 460 



gat gaa cca ctt gaa aaa gca tat agt cat cag ctt aat tac gcg gaa 1440 
Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu 
4S5 470 475 480 



tgt ttc tta atg cag gac cgt cgt gga aca att cca ttt ttt act tgg 1488 
Cys Phe Leu Met Gin Asp Arg Arg Gly Thr lie Pro Phe Phe Thr Trp 
485 490 495 



aca cat aga agt gta gac ttt ttt aat aca att gat get gaa aag att 1536 
Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys He 
500 505 510 



act caa ctt cca gta gtg aaa gca tat gec ttg tct tea ggt get tec 1584 
Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala Ser 
515 520 525 



att att gaa ggt cca gga ttc aca gga gga aat tta eta ttc eta aaa 1632 
He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu Lys 
530 535 540 



gaa tct agt aat tea att get aaa ttt aaa gtt aca tta aat tea gca 1680 
Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala 
545 550 555 560 



gee ttg tta caa cga tat cgt gta aga ata cgc tat get tct ace act 1728 
Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
565 570 575 



aac tta cga ctt ttt gtg caa aat tea aac aat gat ttt att gte ate 1776 
Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe He Val He 
580 585 590 



tac att aat aaa act atg aat ata gat gat gat tta aca tat caa aca 1824 
Tyr He Asn Lys Thr Met Asn He Asp Asp Asp Leu Thr Tyr Gin Thr 
595 600 605 



ttt gat etc gca act act aat tct aat atg ggg ttc teg ggt gat acg 1872 
Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp Thr 
610 615 620 



aat gaa ctt ata ata gga gca gaa tct ttc gtt tct aat gaa aaa ate 1920 
Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He 
625 630 635 640 



tat ata gat aag ata gaa ttt ate cca gta caa ttg taa 1959 
Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 4 
<211> 652 
<212> PRT 

<213> Bacillus thuringiensis 



Met Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr Pro 
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5 



15 



Asn Ser Glu Leu Pro Thr Asn His Asn Gin Tyr Pro Leu Ala Asp Asn 
20 25 30 

Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg Met 
35 40 45 

Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys Asp 
50 55 SO 

Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val Val 
65 70 75 80 

Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu 
85 90 95 

Asp Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met Ala 
100 105 110 

Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys Ser 
115 120 125 

Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr 
130 135 140 

Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser 
145 150 155 160 

Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
165 170 175 

His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val 
180 185 190 

Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu Leu 
195 200 205 

Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu 
210 215 220 

Asp Val Ala Glu Phe Tyr His Arg Gin Leu Lys Leu Thr Gin Gin Tyr 
225 230 235 240 

Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu Arg 
245 250 255 

Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu 
260 265 270 

Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr Asp 
275 280 285 

Val Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp He 
290 295 300 

Phe Thr Asp Pro He Phe Ser Leu Asn Thr Leu Gin Glu Tyr Gly Pro 
305 310 315 320 

Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe Asp 
325 330 335 

Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr Ser 



340 



345 



350 
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Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr Arg 
355 360 365 

Pro Ser lie Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly Asp 
370 375 380 

Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val 
385 390 395 400 

Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys 
405 410 415 

He Tyr Phe Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp Gin 
420 425 430 

Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn Gly 
435 440 445 

His Val Gly Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr Thr 
450 455 460 

Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu 
4S5 470 475 480 

Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr Trp 
485 490 495 

Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys He 
500 505 510 

Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala Ser 
515 520 525 

He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu Lys 
530 535 540 

Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala 
545 550 555 560 

Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
565 570 575 

Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe He Val He 
580 585 590 

Tyr He Asn Lys Thr Met Asn He Asp Asp Asp Leu Thr Tyr Gin Thr 
595 600 605 

Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp Thr 
610 615 620 

Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He 
625 630 635 640 

Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 5 
<211> 1962 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: synthetic 
non-naturally occurring nucleotide sequence 
encoding a Cry3Bb amino acid sequence 



atg aac cct aac aat cgt tec gaa cac gac acc ate aag gtt act cca 
Met Asn Pro Asn Asn Arg Ser Glu His Asp Thr lie Lys Val Thr Pro 



ggt gtt cca ttc get gga get ttg acc age ttc tac cag tec ttt etc 
Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu 



aac acc ate tgg cct tea gat get gat ccc tgg aag get ttc atg gec 
Asn Thr lie Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met Ala 
100 105 110 

caa gtg gaa gtc ttg ate gat aag aag ate gaa gag tat gee aag tct 
Gin Val Glu Val Leu lie Asp Lys Lys lie Glu Glu Tyr Ala Lys Ser 
115 120 125 

aaa gec ttg get gag ttg caa ggt ttg cag aac aac ttc gag gat tac 
Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr 



gtc aac gca etc aac age tgg aag aaa act ccc ttg agt etc agg tct 
Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser 



aag cgt tec cag gac cgt att cgt gaa ctt ttc age caa gee gaa tec 
Lys Arg Ser Gin Asp Arg lie Arg Glu Leu Phe Ser Gin Ala Glu Ser 



cac ttc aga aac tec atg cct age ttt gee gtt tct aag ttc gag gtg 
His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val 



etc ttc ttg cca aca tac gca caa get gec aac act cat etc ttg ctt 
Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu Leu 
195 200 205 

etc aaa gac get cag gtg ttt ggt gag gaa tgg ggt tac tec agt gaa 
Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu 
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210 215 220 

gat gtt gcc gag ttc tac cat agg cag etc aag ttg act caa cag tac 720 
Asp Val Ala Glu Phe Tyr His Arg Gin Leu Lys Leu Thr Gin Gin Tyr 
225 230 235 240 

aca gac cac tgc gtc aac tgg tac aac gtt ggg etc aat ggt ctt aga 768 
Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu Arg 
245 250 255 

gga tct acc tac gac gca tgg gtg aag ttc aac agg ttt cgt aga gag 815 
Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu 
260 265 270 

atg acc ttg act gtg etc gat ctt ate gtt etc ttt cca ttc tac gac 864 
Met Thr Leu Thr Val Leu Asp Leu lie Val Leu Phe Pro Phe Tyr Asp 
275 280 285 

att cgt ctt tac tec aaa ggc gtt aag aca gag ctg acc aga gac ate 912 
lie Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp lie 
290 295 300 



act ttt etc tec att gag aac age ate agg aag cct cac etc ttc gac 1008 
Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe Asp 
325 330 335 

tat ctg caa ggc att gag ttt cac acc agg ttg caa cct ggt tac ttc 1056 
Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr Phe 
340 345 350 

ggt aag gat tec ttc aac tac tgg age gga aac tac gtt gaa acc aga 1104 
Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr Arg 
355 360 365 

cca tec ate gga tct age aag acc ate act tct cca ttc tac ggt gac 1152 
Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly Asp 
370 375 380 

aag age act gag cca gtg cag aag ttg age ttc gat ggg cag aag gtg 1200 
Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val 
385 390 395 400 

tat aga acc ate gcc aat acc gat gtt gca get tgg cct aat ggc aag 1248 
Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys 



gtc tac ctt gga gtt act aaa gtg gac ttc tec caa tac gac gat cag 1296 
Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp Gin 
420 425 430 

aag aac gag aca tct act caa acc tac gat agt aag agg aac aat ggc 1344 
Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn Gly 



i Pro Pro Glu Thr Thr 



gat gaa cca ttg gag aag get tac agt cac caa ctt aac tac gcc gaa 
Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu 
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465 470 475 480 



tgc ttt etc atg caa gac agg cgt ggc acc att ccg ttc ttt aca tgg 1488 
Cys Phe Leu Met Gin Asp Arg Arg Gly Thr lie Pro Phe Phe Thr Trp 
485 490 495 

act cac agg tct gtc gac ttc ttt aac act ate gac get gag aag att 1536 
Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys He 
500 505 510 

acc caa ctt ccc gtg gtc aag get tat gec ttg tec age gga get tec 1584 
Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala Ser 
515 520 525 

ate att gaa ggt cca ggc ttc acc ggt ggc aac ttg etc ttc ctt aag 1632 
He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu Lys 
530 535 540 

gag tec age aac tec ate gee aag ttc aaa gtg aca ctt aac tea gca 1680 
Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala 
545 550 555 550 

gee ttg etc caa cgt tac agg gtt cgt ate aga tac gca age act acc 1728 
Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
565 570 575 

aat ctt cgc etc ttt gtc cag aac age aac aat gat ttc ctt gtc ate 1776 
Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val He 
580 585 590 

tac ate aac aag act atg aac aaa gac gat gac etc acc tac aac aca 1824 
Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Asn Thr 
595 600 605 

ttc gat ctt gec act acc aat agt aac atg gga ttc tct ggt gac aag 1872 
Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp Lys 
610 615 620 

aac gag ctg ate ata ggt get gag age ttt gtc tct aat gag aag att 1920 
Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He 
625 630 635 640 

tac ata gac aag ate gag ttc att cca gtt caa etc taatag 1962 
Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 6 
<211> 652 
<212> PRT 

<213> Artificial Sequence 
<400> 6 

Met Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr Pro 
15 10 15 

Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp Asn 
20 25 30 

Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg Met 
35 40 45 

Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys Asp 
50 55 60 
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Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val Val 



Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu 
85 90 95 



Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser 



Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser 
165 170 175 



He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp He 



Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly Asp 
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Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys 
405 410 415 

Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp Gin 



Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu 



Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala 
545 550 555 560 

Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
565 570 575 

Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val He 



<210> 7 
<211> 1989 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: non-naturally 
occurring nucleotide sequence encoding a variant 
Cry3Bb amino acid sequence V11231 

<220> 

<221> CDS 

<222> (3) . . (1961) 

<223> coding sequence for Cry3Bb variant V11231 amino 
acid sequence 
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act cca aac tct gag ttg caa act aat cac aac cag tac cca ttg get 
Thr Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala 



gac aat cct aac agt act ctt gag gaa ctt aac tac aag gag ttt etc 
Asp Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu 



gtg gtt ggt gtt cca ttc get gga get ttg ace age ttc tac cag tec 
Val Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser 



ttt etc aac ace ate tgg cct tea gat get gat ccc tgg aag get ttc 
Phe Leu Asn Thr lie Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe 
100 105 110 

atg gee caa gtg gaa gtc ttg ate gat aag aag ate gaa gag tat gee 
Met Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala 
115 120 125 

aag tct aaa gee ttg get gag ttg caa ggt ttg cag aac aac ttc gag 
Lys Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu 



agg tct aag cgt tec cag gac cgt att cgt gaa ctt ttc age caa gee 
Arg Ser Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala 
160 165 170 175 



ttg ctt etc aaa gac get cag gtg ttt ggt gag gaa tgg ggt tac tec 
Leu Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser 



cag tac aca gac cac tgc gtc aac tgg tac aac gtt ggg etc aat ggt 
Gin Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly 
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ctt aga gga tct acc tac gac gca tgg gtg aag ttc aac agg ttt cgt 815 
Leu Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg 
260 265 270 

aga gag atg acc ttg act gtg etc gat ctt ate gtt etc ttt cca ttc 863 
Arg Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe 
275 280 285 

tac gac att cgt ctt tac tec aaa ggc gtt aag aca gag ctg acc aga 911 
Tyr Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg 
290 295 300 

gac ate ttc acc gat ccc ate ttc eta ctt acg acc ctg cag aaa tac 959 
Asp He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr 



ggt cca act ttt etc tec att gag aac age ate agg aag cct cac etc 1007 
Gly Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu 
320 325 330 335 

ttc gac tat ctg caa ggc att gag ttt cac acc agg ttg caa cct ggt 1055 
Phe Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly 
340 345 350 

tac ttc ggt aag gat tec ttc aac tac tgg age gga aac tac gtt gaa 1103 
Tyr Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu 
355 360 3S5 

acc aga cca tec ate gga tct age aag acc ate act tct cca ttc tac 1151 
Thr Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr 
370 375 380 

ggt gac aag age act gag cca gtg cag aag ttg age ttc gat ggg cag 1199 
Gly Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin 
385 390 395 

aag gtg tat aga acc ate gec aat acc gat gtt gca get tgg cct aat 1247 
Lys Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn 
400 405 410 415 

ggc aag gtc tac ctt gga gtt act aaa gtg gac ttc tec caa tac gac 1295 
Gly Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp 
420 425 430 

gat cag aag aac gag aca tct act caa acc tac gat agt aag agg aac 1343 
Asp Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn 



aat ggc cat gtt tec gca caa gac tec att gac caa ctt cca cct gaa 
Asn Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu 
450 455 460 

acc act gat gaa cca ttg gag aag get tac agt cac caa ctt aac tac 
Thr Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr 



gec gaa tgc ttt etc atg caa gac agg cgt ggc acc att ccg ttc ttt 1487 
Ala Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe 



aca tgg act cac agg tct gtc gac ttc ttt aac act ate gac get gag 1535 
Thr Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu 
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500 505 510 

aag att acc caa ctt ccc gtg gtc aag get tat gec ttg tec age gga 1583 
Lys lie Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly 
515 520 525 

get tec ate att gaa ggt cca ggc ttc acc ggt ggc aac ttg etc ttc 1631 
Ala Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe 
530 535 540 

ctt aag gag tec age aac tec ate gee aag ttc aaa gtg aca ctt aac 1679 
Leu Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn 
545 550 555 

tea gca gee ttg etc caa cgt tac agg gtt cgt ate aga tac gca age 1727 
Ser Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser 
560 565 570 575 

act acc aat ctt cgc etc ttt gtc cag aac age aac aat gat ttc ctt 1775 
Thr Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu 
580 585 590 

gtc ate tac ate aac aag act atg aac aaa gac gat gac etc acc tac 1823 
Val He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr 
595 600 605 

caa aca ttc gat ctt gec act acc aat agt aac atg gga ttc tct ggt 1871 
Gin Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly 
610 615 620 

gac aag aac gag ctg ate ata ggt get gag age ttt gtc tct aat gag 1919 
Asp Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu 
625 630 635 

aag att tac ata gac aag ate gag ttc att cca gtt caa etc 1961 

Lys He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
640 645 650 

taatagatcc cccgggctgc aggaattc 1989 

<210> 8 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 8 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 
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Leu Asn Thr lie Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 



Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 



Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 



Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 

260 265 270 



Asp lie Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 



Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 



Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
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Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 



Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 



He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 



<210> 9 
<211> 1984 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: non-naturally 
occurring nucleotide sequence encoding a Cry3Bb 
variant 1123 lmvl amino acid sequence 

<220> 

<221> CDS 

<222> (3) . . (1961) 

<223> coding sequence for a Cry3Bb variant 11231mvl 
amino acid sequence 

<400> 9 

cc atg gcc aac ccc aac aat cgc tec gag cac gac acg ate aag gtc 47 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val 
15 10 15 
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cgc atg acc gag gac tec tec acg gag gtc ctg gac aac tec acc gtc 
Arg Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val 



aag gac gec gtc ggg acc ggc ate tec gtc gtt ggg cag ate ctg ggc 
Lys Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly 



gtc gtt ggc gtc ccc ttc gca ggt get etc acc tec ttc tac cag tec 
Val Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser 



ttc ctg aac acc ate tgg ccc tec gac gee gac ccc tgg aag gee ttc 
Phe Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe 



aag tec aag gee ctg gee gag ctg caa ggc ctg caa aac aac ttc gag 
Lys Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu 



gae tac gtc aac gcg ctg aac tec tgg aag aag acg cct ctg tec ctg 
Asp Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu 



gag gtc ctg ttc ctg ccc acc tac gee cag get gee aac acc cac etc 
Glu Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu 
195 200 205 

ctg ttg ctg aag gac gee cag gtc ttc ggc gag gaa tgg ggc tac tee 
Leu Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser 
210 215 220 

teg gag gac gtc gec gag ttc tac cgt cgc cag ctg aag ctg acc caa 
Ser Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin 
225 230 235 

cag tac acc gac cac tgc gtc aac tgg tac aac gtc ggc ctg aac ggc 
Gin Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly 



ctg agg ggc tee acc tac gac gca tgg gtc aag ttc aac cgc ttc cgc 
Leu Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg 
260 265 270 



21 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 PCT/US99/18883 

agg gag atg acc ctg acc gtc ctg gac ctg ate gtc ctg ttc ccc ttc 863 

Arg Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe 

275 280 285 

tac gac ate cgc ctg tac tec aag ggc gtc aag acc gag ctg acc cgc 911 
Tyr Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg 
290 295 300 

gac ate ttc acg gac ccc ate ttc ctg etc acg acc etc cag aag tac 959 
Asp He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr 
305 310 315 

ggt ccc acc ttc ctg tec ate gag aac tec ate cgc aag cec cac ctg 1007 
Gly Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu 
320 325 330 335 

ttc gac tac etc cag ggc ate gag ttc cac acg cgc ctg agg cca ggc 1055 
Phe Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Arg Pro Gly 
340 345 350 

tac ttc ggc aag gac tec ttc aac tac tgg tec ggc aac tac gtc gag 1103 
Tyr Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu 
355 360 365 

acc agg ccc tec ate ggc tec teg aag acg ate acc tec cct ttc tac 1151 
Thr Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr 
370 375 380 

ggc gac aag tec acc gag ccc gtc cag aag ctg tec ttc gac ggc cag 1199 
Gly Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin 
385 390 395 

aag gtc tac cgc acc ate gec aac acc gac gtc gcg get tgg ccg aac 1247 
Lys Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn 
400 405 410 415 

ggc aag gtc tac ctg ggc gtc acg aag gtc gac ttc tec cag tac gat 1295 
Gly Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp 
420 425 430 

gac cag aag aat gaa acc tec acc cag acc tac gac tec aag cgc aac 1343 
Asp Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn 
435 440 445 



acc act gac gag ccc ctg gag aag gec tac tec cac cag ctg aac tac 
Thr Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr 
465 470 475 

gcg gag tgc ttc ctg atg caa gac cgc agg ggc acc ate ccc ttc ttc 
Ala Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe 
480 485 490 495 

acc tgg acc cac ego tec gtc gac ttc ttc aac acc ate gac gee gag 
Thr Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu 
500 505 510 
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gcc tec ate att gag ggt cca ggc ttc acc ggt ggc aac ctg ctg ttc 1631 

Ala Ser lie lie Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe 

530 535 540 

ctg aag gag tec teg aae tec ate gcc aag ttc aag gtc acc ctg aac 1679 
Leu Lys Glu Ser Ser Asn Ser lie Ala Lys Phe Lys Val Thr Leu Asn 
545 550 555 

tec get gcc ttg ctg caa cgc tac cgc gtc cgc ate cgc tac gcc tec 1727 
Ser Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg lie Arg Tyr Ala Ser 
560 565 570 575 

acc acg aac ctg cgc ctg ttc gtc cag aac tec aac aat gac ttc ctg 1775 
Thr Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu 
580 585 590 

gtc ate tac ate aac aag aec atg aac aag gac gat gac ctg acc tac 1823 
Val lie Tyr lie Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr 
595 600 605 

cag acc ttc gac etc gcc acc acg aac tec aac atg ggc ttc teg ggc 1871 
Gin Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly 
610 615 620 

gac aag aat gaa ctg ate att ggt get gag tec ttc gtc tec aat gaa 1919 
Asp Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu 
625 630 635 

aag ate tae ate gac aag ate gag ttc ate ccc gtc cag ctg 1961 
Lys He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
640 645 650 

tgataggaac tctgattgaa ttc 1984 



<210> 10 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 10 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 
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Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp 
130 135 140 

Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg 
145 150 155 160 

Ser Lys Arg Ser Gin Gly Arg He Arg Glu Leu Phe Ser Gin Ala Glu 
165 170 175 

Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 
180 185 190 

Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 
195 200 205 

Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
210 215 220 

Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 

Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 

He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 

Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe 
325 330 335 

Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Arg Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
405 410 415 

Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 
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Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 ■ 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 570 575 

Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 

He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 11 
<211> 1984 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: non-naturally 
occurring nucleotide sequence encoding a Cry3Bb 
variant 11231mv2 amino acid sequence 

<220> 

<221> CDS 

<222> (3) . . (1961) 

<223> coding sequence for a Cry3Bb variant 11231rav2 
amino acid sequence 

<400> 11 

cc atg gcc aac ccc aac aat cgc tec gag cac gac acg ate aag gtc 47 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val 
15 10 15 

acc ccc aac tec gag etc cag acc aac cac aac cag tac ccg ctg gcc 95 
Thr Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala 
20 25 30 

gac aac ccc aac tec acc ctg gaa gag ctg aac tac aag gag ttc ctg 143 
Asp Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu 
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ttc ctg aac acc ate tgg ccc tec gac gec gac ccc tgg aag gec ttc 
Phe Leu Asn Thr lie Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe 
100 105 110 



gac tac gtc aac gcg ctg aac tec tgg aag aag acg cct ctg tec ctg 
Asp Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu 
145 150 155 

cgc tec aag cgc tec cag gac cgc ate cgc gag ctg ttc tec cag gec 
Arg Ser Lys Arg Ser Gin Asp Arg lie Arg Glu Leu Phe Ser Gin Ala 
160 165 170 175 

gag tec cao ttc cgc aac tec atg ccg tec ttc gee gtc tec aag ttc 
Glu Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe 
180 185 190 

gag gtc ctg ttc ctg ccc acc tac gee cag get gec aac acc cac etc 
Glu Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu 
195 200 205 

etg ttg ctg aag gac gee cag gtc ttc ggc gag gaa tgg ggc tac tec 
Leu Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser 
210 215 220 



: tgc gtc aac tgg tac aac gtc ggc ctg aac ggc 
3 Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly 
245 250 255 



agg gag atg acc ctg acc gtc ctg gac ctg ate gtc ctg ttc ccc ttc 
Arg Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe 
275 280 285 
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ttc gac tac etc cag ggc ate gag ttc cac acg cgc ctg agg cca ggc 

Phe Asp Tyr Leu Gin Gly lie Glu Phe His Thr Arg Leu Arg Pro Gly 

340 345 350 

tac ttc ggc aag gac tec ttc aac tac tgg tec ggc aac tac gtc gag 

Tyr Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu 

355 360 365 



aag gtc tac cgc acc ate gee aac ace gac gtc gcg get tgg ccg aac 
Lys Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn 
400 405 410 415 

ggc aag gtc tac ctg ggc gtc acg aag gtc gac ttc tec cag tac gat 
Gly Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp 
420 425 430 

gac cag aag aat gaa acc tec acc cag acc tac gac tec aag cgc aac 
Asp Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn 
435 440 445 

aat ggc cac gtc tec gec cag gac tec ate gac cag ctg ccg cct gag 
Asn Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu 
450 455 460 

acc act gac gag ccc ctg gag aag gee tac tec cac cag ctg aac tac 
Thr Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr 
465 470 475 

gcg gag tgc ttc ctg atg caa gac cgc agg ggc acc ate ccc ttc ttc 
Ala Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe 
480 485 490 495 

acc tgg acc cac cgc tec gtc gac ttc ttc aac acc ate gac gee gag 
Thr Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu 
500 505 510 



gee tec ate att gag ggt cca ggc ttc acc ggt ggc aac ctg ctg ttc 
Ala Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe 
530 535 540 
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545 550 555 

tec get gee ttg ctg caa cgc tac cgc gtc cgc ate cgc tac gec tec 1727 
Ser Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg lie Arg Tyr Ala Ser 
560 565 570 575 

aec aeg aac ctg cgc ctg ttc gtc cag aac tec aac aat gac ttc ctg 1775 
Thr Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu 
580 585 590 

gtc ate tac ate aac aag acc atg aac aag gac gat gac ctg acc tac 1823 
Val lie Tyr lie Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr 
595 600 605 

cag acc ttc gac etc gee acc acg aac tec aac atg ggc ttc teg ggc 1871 
Gin Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly 
610 615 620 

gac aag aat gaa ctg ate att ggt get gag tec ttc gtc tec aat gaa 1919 
Asp Lys Asn Glu Leu lie He Gly Ala Glu Ser Phe Val Ser Asn Glu 
625 630 635 

aag ate tac ate gac aag ate gag ttc ate ccc gtc cag ctg 1961 
Lys He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
640 645 650 

tgataggaac tctgattgaa ttc 1984 

<210> 12 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 

35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 ' 95 



Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp 
130 135 140 

Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 



PC17US99/18883 



Ser Lys Arg Ser Gin Asp Arg lie Arg Glu Leu Phe Ser Gin Ala Glu 



Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 
180 185 190 



Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 



Asp Tyr Leu Gin Gly lie Glu Phe His Thr Arg Leu Arg Pro Gly Tyr 



Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 



Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 



Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 
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Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr lie Asp Ala Glu Lys 



Lys Glu Ser Ser Asn Ser lie Ala Lys Phe Lys Val Thr Leu Asn Sej 



Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg lie Arg Tyr Ala Ser Thr 
565 570 575 



<210> 13 
<211> 4149 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 



<220> 

<221> promoter 
<222> (25) . . (640) 
<223> P-CaMV.35S 

<220> 

<221> intron 

<222> (669) . . (1472) 

<223> I-Zm.Hsp70 

<220> 

<221> transit_peptide 

<222> (1489) . . (1535) 

<223> amino terminal TS-Zm.rbcS 

<220> 

<221> intron 

<222> (1636) . . (1798) 

<223> I-Zm.rbcS 

<220> 

<221> transit_peptide 

<222> (1799) . . (1885) 

<223> carboxy terminus TS-Zm.rbcS 



30 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 PCT/US99/18883 

<220> 
<221> CDS 

<222> (1885) . . (3843) 

<223> Cry3Bbl variant V11231 

<220> 

<221> terminator 
<222> (3871) . . (4127) 

<223> T-AGRtu . nos 3' transcription termination and 
polyadenylation sequence 

<400> 13 

gcggccgcgt taacaagctt ctgcaggtcc gatgtgagac ttttcaacaa agggtaatat 60 
ccggaaacct cctcggattc cattgcccag ctatctgtca ctttattgtg aagatagtgg 120 
aaaaggaagg tggctcctac aaatgccatc attgcgataa aggaaaggcc atcgttgaag 180 
atgcctctgc cgacagtggt cccaaagatg gacccccacc cacgaggagc atcgtggaaa 24 0 
aagaagacgt tccaaccacg tcttcaaagc aagtggattg atgtgatggt ccgatgtgag 300 
acttttcaac aaagggtaat atccggaaac ctcctcggat tccattgccc agctatctgt 360 
cactttattg tgaagatagt gaaaaggaag gtggctccta caaatgccat cattgcgata 420 
aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 480 
ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 54 0 
gatgtgatat ctccactgac gtaagggatg acgcacaatc ccactatcct tcgcaagacc 60 0 
cttcctctat ataaggaagt tcatttcatt tggagaggac acgctgacaa gctgactcta 660 
gcagatctac cgtcttcggt acgcgctcac tccgccctct gcctttgtta ctgccacgtt 720 
tctctgaatg ctctcttgtg tggtgattgc tgagagtggt ttagctggat ctagaattac 780 
actctgaaat cgtgttctgc ctgtgctgat tacttgccgt cctttgtagc agcaaaatat 840 
agggacatgg tagtacgaaa cgaagataga acctacacag caatacgaga aatgtgtaat 900 
ttggtgctta gcggtattta tttaagcaca tgttggtgtt atagggcact tggattcaga 960 
agtttgctgt taatttaggc acaggcttca tactacatgg gtcaatagta tagggattca 1020 
tattataggc gatactataa taatttgttc gtctgcagag cttattattt gccaaaatta 1080 
gatattccta ttctgttttt gtttgtgtgc tgttaaattg ttaacgcctg aaggaataaa 1140 
tataaatgac gaaattttga tgtttatctc tgctccttta ttgtgaccat aagtcaagat 1200 
cagatgcact tgttttaaat attgttgtct gaagaaataa gtactgacag tattttgatg 1260 
cttgatctgc ttgtttgttg taacaaaatt taaaaataaa gagtttcctt tttgttgctc 1320 
tccttacctc ctgatggtat ctagtatcta ccaactgaca ctatattgct tctctttaca 1380 
tacgtatctt gctcgatgcc ttctccctag tgttgaccag tgttactcac atagtctttg 1440 
ctcatttcat tgtaatgcag ataccaagcg gcctctagag gatcagcatg gcgcccaccg 1500 
tgatgatggc ctcgtcggcc accgccgtcg ctccgttcct ggggctcaag tccaccgcca 1560 
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gcctccccgt cgcccgccgc tcctccagaa gcctcggcaa cgtcagcaac ggcggaagga 1620 

tccggtgcat gcaggtaaca aatgcatcct agctagcagt tctttgcatt gcagcagctg 1680 

cagctagcga gttagtaata ggaagggaac tgatgatcca tgcatggact gatgtgtgtt 1740 

gcccatccca tcccatccca tttcccaaac gaaccgaaaa caccgtacta cgtgcaggtg 1800 

tggccctacg gcaacaagaa gttcgagacg ctgtcgtacc tgccgccgct gtcgaccggc 1860 

gggcgcatcc gctgcatgca ggcc atg gca aac cct aac aat cgt tec gaa 1911 
Met Ala Asn Pro Asn Asn Arg Ser Glu 



aac cag tac cca ttg get gac aat cct aac agt act ctt gag gaa ctt 
Asn Gin Tyr Pro Leu Ala Asp Asn Pro Asn Ser Thr Leu Glu Glu Leu 



aac tac aag gag ttt etc egg atg acc gaa gat age tec act gag gtt 2055 
Asn Tyr Lys Glu Phe Leu Arg Met Thr Glu Asp Ser Ser Thr Glu Val 



etc gat aac tct aca gtg aag gac get gtt gga act ggc att age gtt 2103 
Leu Asp Asn Ser Thr Val Lys Asp Ala Val Gly Thr Gly He Ser Val 



gtg gga cag att ctt gga gtg gtt ggt gtt cca ttc get gga get ttg 2151 
Val Gly Gin He Leu Gly Val Val Gly Val Pro Phe Ala Gly Ala Leu 



acc age ttc tac cag tec ttt etc aac acc ate tgg cct tea gat get 
Thr Ser Phe Tyr Gin Ser Phe Leu Asn Thr He Trp Pro Ser Asp Ala 



ttg cag aac aac ttc gag gat tac gtc aac gca etc aac age tgg aag 2343 
Leu Gin Asn Asn Phe Glu Asp Tyr Val Asn Ala Leu Asn Ser Trp Lys 



aaa act ccc ttg agt etc agg tct aag cgt tec cag gac cgt att cgt 2391 
Lys Thr Pro Leu Ser Leu Arg Ser Lys Arg Ser Gin Asp Arg He Arg 
155 160 165 

gaa ctt tte age caa gee gaa tec cac ttc aga aac tec atg cct age 243 9 
Glu Leu Phe Ser Gin Ala Glu Ser His Phe Arg Asn Ser Met Pro Ser 
170 175 180 185 

ttt gec gtt tct aag ttc gag gtg etc ttc ttg cca aca tac gca caa 2487 
Phe Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro Thr Tyr Ala Gin 
190 195 200 

get gee aac act cat etc ttg ctt etc aaa gac get cag gtg ttt ggt 2535 
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Ala Ala Asn Thr His Leu Leu Leu Leu Lys Asp Ala Gin Val Phe Gly 
205 210 215 

gag gaa tgg ggt tac tec agt gaa gat gtt gec gag ttc tac cgt agg 2583 
Glu Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu Phe Tyr Arg Arg 



aac gtt ggg etc aat ggt ctt aga gga tct acc tac gac gca tgg gtg 
Asn Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr Asp Ala Trp Val 
250 255 260 2S5 

aag ttc aac agg ttt cgt aga gag atg acc ttg act gtg etc gat ctt 
Lys Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp Leu 



ate gtt etc ttt cca ttc tac gac att cgt ctt tac tec aaa ggc gtt 
He Val Leu Phe Pro Phe Tyr Asp He Arg Leu Tyr Ser Lys Gly Val 



aag aca gag ctg acc aga gac ate ttc acc gat ccc ate ttc eta ctt 
Lys Thr Glu Leu Thr Arg Asp He Phe Thr Asp Pro He Phe Leu Leu 



acg acc ctg cag aaa tac ggt cca act ttt etc tec att gag aac age 

Thr Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser He Glu Asn Ser 
315 320 325 

ate agg aag cct cac etc ttc gac tat ctg caa ggc att gag ttt cac 

He Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly He Glu Phe His 

330 335 340 345 

acc agg ttg caa cct ggt tac ttc ggt aag gat tec ttc aac tac tgg 

Thr Arg Leu Gin Pro Gly Tyr Phe Gly Lys Asp Ser Phe Asn Tyr Trp 

350 355 360 

age gga aac tac gtt gaa acc aga cca tec ate gga tct age aag acc 

Ser Gly Asn Tyr Val Glu Thr Arg Pro Ser He Gly Ser Ser Lys Thr 
365 370 375 

ate act tct cca ttc tac ggt gac aag age act gag cca gtg cag aag 

He Thr Ser Pro Phe Tyr Gly Asp Lys Ser Thr Glu Pro Val Gin Lys 
380 385 390 

ttg age ttc gat ggg cag aag gtg tat aga acc ate gec aat acc gat 

Leu Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr He Ala Asn Thr Asp 
395 400 405 

gtt gca get tgg cct aat ggc aag gtc tac ctt gga gtt act aaa gtg 

Val Ala Ala Trp Pro Asn Gly Lys Val Tyr Leu Gly Val Thr Lys Val 

410 415 420 425 

gac ttc tec caa tac gac gat cag aag aac gag aca tct act caa acc 

Asp Phe Ser Gin Tyr Asp Asp Gin Lys Asn Glu Thr Ser Thr Gin Thr 

430 435 440 

tac gat agt aag agg aac aat ggc cat gtt tec gca caa gac tec att 

Tyr Asp Ser Lys Arg Asn Asn Gly His Val Ser Ala Gin Asp Ser He 



i cct gaa acc act gat gaa cca ttg gag aag get 
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460 465 470 
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agt cac caa ctt aac tac gcc gaa tgc ttt etc atg caa gac agg cgt 3351 
Ser His Gin Leu Asn Tyr Ala Glu Cys Phe Leu Met Gin Asp Arg Arg 
475 480 485 

ggc acc att ccg ttc ttt aca tgg act cac agg tct gtc gac ttc ttt 3399 
Gly Thr He Pro Phe Phe Thr Trp Thr His Arg Ser Val Asp Phe Phe 
490 495 500 505 

aac act ate gac get gag aag att acc caa ctt ccc gtg gtc aag get 3447 
Asn Thr He Asp Ala Glu Lys He Thr Gin Leu Pro Val Val Lys Ala 
510 515 520 

tat gcc ttg tec age gga get tec ate att gaa ggt cca ggc ttc acc 3495 
Tyr Ala Leu Ser Ser Gly Ala Ser He He Glu Gly Pro Gly Phe Thr 
525 530 535 

ggt ggc aac ttg etc ttc ctt aag gag tec age aac tec ate gcc aag 3543 
Gly Gly Asn Leu Leu Phe Leu Lys Glu Ser Ser Asn Ser He Ala Lys 
540 545 550 

ttc aaa gtg aca ctt aac tea gca gcc ttg etc caa cgt tac agg gtt 3 591 
Phe Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin Arg Tyr Arg Val 
555 560 5S5 

cgt ate aga tac gca age act acc aat ctt cgc etc ttt gtc cag aac 3639 
Arg He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu Phe Val Gin Asn 
570 575 580 585 

age aac aat gat ttc ctt gtc ate tac ate aac aag act atg aac aaa 3687 
Ser Asn Asn Asp Phe Leu Val He Tyr He Asn Lys Thr Met Asn Lys 
590 595 600 

gac gat gac etc acc tac caa aca ttc gat ctt gcc act acc aat agt 3 73 5 
Asp Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala Thr Thr Asn Ser 
605 610 615 

aac atg gga ttc tct ggt gac aag aac gag ctg ate ata ggt get gag 3 783 
Asn Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He He Gly Ala Glu 
620 625 630 

age ttt gtc tct aat gag aag att tac ata gac aag ate gag ttc att 3831 
Ser Phe Val Ser Asn Glu Lys He Tyr He Asp Lys He Glu Phe He 
635 640 645 

cca gtt caa etc taatagatcc cccgggctgc aggaattccc gatcgttcaa 3 883 

Pro Val Gin Leu 

650 

acatttggca ataaagtttc ttaagattga atcctgttgc eggtcttgeg atgattatca 3943 

tataatttct gttgaattac gttaagcatg taataattaa catgtaatgc atgacgttat 4003 

ttatgagatg ggtttttatg attagagtcc cgcaattata catttaatac gcgatagaaa 4063 

acaaaatata gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct atgttactag 4123 

ateggggata tccccggggc ggcege 4149 



<210> 14 
<211> 653 
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<212> PRT 

<213> Artificial Sequence 
<400> 14 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr lie Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 



Ser Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu 



Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 



Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 

260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
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325 



lie Glu Asn Ser He Arg Lys Pro His Leu Phe 
330 335 
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Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
405 410 415 

Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 

Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 

Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 570 575 

Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 

He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 
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<21Q> 15 
<211> 3754 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 
cassette 

<220> 

<221> promoter 
<222> (25) . . (S40) 
<223> P-C3MV.35S 

<220> 

<221> intron 

<222> (669) . . (1472) 

<223> I-Zm.Hsp70 

<220> 
<221> CDS 

<222> (1490) . . (3448) 

<223> Cry3Bbl variant V11231 

<220> 

<221> terminator 
<222> (3475) . . (3730) 

<223> Agrobacterium tumefaciens nos 3' transcription 
termination and polyadenylation sequence 

<400> 15 

gcggccgcgt taacaagctt ctgcaggtcc gatgtgagac ttttcaacaa agggtaatat 60 
ccggaaacct cctcggattc cattgcccag ctatctgtca ctttattgtg aagatagtgg 120 
aaaaggaagg tggctcctac aaatgccatc attgcgataa aggaaaggcc atcgttgaag 180 
atgcctctgc cgacagtggt cccaaagatg gacccccacc cacgaggagc atcgtggaaa 24 0 
aagaagacgt tccaaccacg tcttcaaagc aagtggattg atgtgatggt ccgatgtgag 300 
acttttcaac aaagggtaat atccggaaac ctcctcggat tccattgccc agctatctgt 360 
cactttattg tgaagatagt ggaaaaggaa ggtggctcct acaaatgcca tcattgcgat 420 
aaaggaaagg ccatcgttga agatgcctct gccgacagtg gtcccaaaga tggaccccca 480 
cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat 54 0 
tgatgtgata tctccactga cgtaagggat gacgcacaat cccactatcc ttcgcaagac 600 
ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaca agctgactct 660 
agcagatcta ccgtcttcgg tacgcgctca ctccgccctc tgcctttgtt actgccacgt 720 
ttctctgaat gctctcttgt gtggtgattg ctgagagtgg tttagctgga tctagaatta 780 
cactctgaaa tcgtgttctg cctgtgctga ttacttgccg tcctttgtag cagcaaaata 840 
tagggacatg gtagtacgaa acgaagatag aacctacaca gcaatacgag aaatgtgtaa 900 
tttggtgctt agcggtattt atttaagcac atgttggtgt tatagggcac ttggattcag 960 
aagtttgctg ttaatttagg cacaggcttc atactacatg ggtcaatagt atagggattc 1020 
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atattatagg cgatactata ataatttgtt cgtctgcaga gcttattatt tgccaaaatt 10 80 
agatattcct attctgtttt tgtttgtgtg ctgttaaatt gttaacgcct gaaggaataa 1140 
atataaatga cgaaattttg atgtttatct ctgctccttt attgtgacca taagtcaaga 1200 
tcagatgcac ttgttttaaa tattgttgtc tgaagaaata agtactgaca gtattttgat 1260 
gcattgatct gcttgtttgt tgtaacaaaa tttaaaaata aagagtttcc tttttgttgc 1320 
tctccttacc tcctgatggt atctagtatc taccaactga cactatattg cttctcttta 1380 
catacgtatc ttgctcgatg ccttctccct agtgttgacc agtgttactc acatagtctt 1440 
tgctcatttc attgtaatgc agataccaag cggcctctag aggatct 



cct aac aat cgt tec gaa cac gac acc ate aag gtt act cca aac tct 
Pro Asn Asn Arg Ser Glu His Asp Thr lie Lys Val Thr Pro Asn Ser 



gag ttg caa act aat cac aac cag tac cca ttg get gac aat cct aac 
Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp Asn Pro Asn 



agt act ctt gag gaa ctt aac tac aag gag ttt etc egg atg acc gaa 
Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg Met Thr Glu 



gga act ggc att age gtt gtg gga cag att ctt gga gtg gtt ggt gtt 
Gly Thr Gly lie Ser Val Val Gly Gin He Leu Gly Val Val Gly Val 



cca ttc get gga get ttg acc age ttc tac cag tec ttt etc aac acc 
Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu Asn Thr 



ate tgg cct tea gat get gat ccc tgg aag get ttc atg gec caa gtg 
He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met Ala Gin Val 
100 105 110 115 

gaa gtc ttg ate gat aag aag ate gaa gag tat gec aag tct aaa gee 
Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys Ser Lys Ala 



ttg get gag ttg caa ggt ttg cag aac aac ttc gag gat tac gtc aac 
Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr Val Asn 
135 140 145 

gca etc aac age tgg aag aaa act ccc ttg agt etc agg tct aag cgt 
Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser Lys Arg 



tec cag gac cgt att cgt gaa ctt ttc age caa gee gaa tec cac ttc 
Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu Ser His Phe 
165 170 175 



: age ttt gee gtt tct aag ttc gag gtg ( 
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Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val Leu Phe 
130 185 190 195 

ttg cca aca tac gca caa get gec aac act cat etc ttg ctt etc aaa 2122 
Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu Leu Leu Lys 
200 205 210 

gac get cag gtg ttt ggt gag gaa tgg ggt tac tec agt gaa gat gtt 2170 
Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu Asp Val 
215 220 225 

gec gag ttc tac cgt agg cag etc aag ttg act caa cag tac aca gac 2218 
Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin Tyr Thr Asp 
230 235 240 

cac tgc gtc aac tgg tac aac gtt ggg etc aat ggt ctt aga gga tct 2266 
His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu Arg Gly Ser 
245 250 255 

ace tac gac gca tgg gtg aag ttc aac agg ttt cgt aga gag atg acc 2314 
Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu Met Thr 
260 265 270 275 

ttg act gtg etc gat ctt ate gtt etc ttt cca ttc tac gac att cgt 2362 
Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr Asp He Arg 
280 285 290 

ctt tac tec aaa ggc gtt aag aca gag ctg acc aga gac ate ttc acc 2410 
Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp He Phe Thr 
295 300 305 

gat ccc ate ttc eta ctt acg acc ctg cag aaa tac ggt cca act ttt 2458 
Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly Pro Thr Phe 
310 315 320 

etc tee att gag aac age ate agg aag cct cac etc ttc gac tat ctg 2506 
Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe Asp Tyr Leu 
325 330 335 

eaa ggc att gag ttt cac acc agg ttg caa cct ggt tac ttc ggt aag 2554 
Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr Phe Gly Lys 
340 345 350 355 

gat tec ttc aac tac tgg age gga aac tac gtt gaa acc aga cca tec 2602 
Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr Arg Pro Ser 
360 365 370 

ate gga tct age aag acc ate act tct cca ttc tac ggt gac aag age 2650 
He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly Asp Lys Ser 
375 380 385 

act gag cca gtg cag aag ttg age ttc gat ggg cag aag gtg tat aga 26 98 
Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val Tyr Arg 
390 395 400 

acc ate gec aat acc gat gtt gca get tgg cct aat ggc aag gtc tac 2746 
Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys Val Tyr 
405 410 415 

ctt gga gtt act aaa gtg gac ttc tec caa tac gac gat cag aag aac 2794 
Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp Gin Lys Asn 
420 425 430 435 

gag aca tct act caa acc tac gat agt aag agg aac aat ggc cat gtt 2842 



39 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 

Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn Gly His Val 
440 445 450 



PCT/US99/18883 



tec gca caa gac tec att gac caa ctt cca cct gaa acc act gat gaa 2890 
Ser Ala Gin Asp Ser lie Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu 
455 460 4S5 

cca ttg gag aag get tac agt cac caa ctt aac tac gec gaa tgc ttt 2938 
Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu Cys Phe 
470 475 480 

cte atg caa gac agg cgt ggc acc att ccg ttc ttt aca tgg act cac 2986 
Leu Met Gin Asp Arg Arg Gly Thr lie Pro Phe Phe Thr Trp Thr His 
485 490 495 

agg tct gtc gac ttc ttt aac act ate gac get gag aag att acc caa 3 034 
Arg Ser Val Asp Phe Phe Asn Thr lie Asp Ala Glu Lys lie Thr Gin 
500 505 510 515 

ctt ccc gtg gtc aag get tat gec ttg tec age gga get tec ate att 3082 
Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala Ser He He 
520 525 530 

gaa ggt cca ggc ttc acc ggt ggc aac ttg etc ttc ctt aag gag tec 3130 
Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu Lys Glu Ser 
535 540 545 

age aac tec ate gee aag ttc aaa gtg aca ctt aac tea gca gee ttg 3178 
Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala Ala Leu 
550 555 560 

etc caa cgt tac agg gtt cgt ate aga tac gca age act acc aat ctt 3226 
Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asn Leu 
565 570 575 

cgc etc ttt gtc cag aac age aac aat gat ttc ctt gtc ate tac ate 3274 
Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val He Tyr He 
580 585 590 595 

aac aag act atg aac aaa gac gat gac etc acc tac caa aca ttc gat 3322 
Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin Thr Phe Asp 
600 605 610 

ctt gee act acc aat agt aac atg gga ttc tct ggt gac aag aac gag 3370 
Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp Lys Asn Glu 
615 620 625 

ctg ate ata ggt get gag age ttt gtc tct aat gag aag att tac ata 3418 
Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He Tyr He 
630 635 640 

gac aag ate gag ttc att cca gtt caa etc taatagatcc cccgggctgc 3468 
Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 

aggaattccc gatcgttcaa acatttggca ataaagtttc ttaagattga atcctgttgc 3528 

eggtcttgeg atgattatca tataatttct gttgaattac gttaagcatg taataattaa 3588 

catgtaatgc atgacgttat ttatgagatg ggtttttatg attagagtcc cgcaattata 3648 

catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat tatcgcgcgc 3708 

ggtgtcatct atgttactag ateggggata tccccggggc ggcege 3 754 
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<210> 16 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 16 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr lie Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly lie Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 



Ser Lys Ala Leu Ala Glu Leu Gin 
130 135 

Tyr Val Asn Ala Leu Asn Ser Trp 
145 150 

Ser Lys Arg Ser Gin Asp Arg He 
165 

Ser His Phe Arg Asn Ser Met Pro 



Gly Leu Gin Asn Asn Phe Glu Asp 
140 

Lys Lys Thr Pro Leu Ser Leu Arg 
155 160 

Arg Glu Leu Phe Ser Gin Ala Glu 
170 175 

Ser Phe Ala Val Ser Lys Phe Glu 
185 190 



Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 
195 200 205 

Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
210 215 220 

Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 

Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 
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lie Phe Thr Asp Pro lie Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 

Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe 
325 330 335 

Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
405 410 415 

Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 

Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 

Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 570 575 

Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 
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PCTYUS99/18883 



<210> 17 

<211> 3450 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 



<220> 

<221> promoter 
<222> (14) . . (235) 
<223> P-CaMV.AS4 

<220> 

<221> 5'UTR 

<222> (240) . . (304) 

<223> L-Ta.hcbl 

<220> 

<221> intron 
<222> (318) . . (805) 
<223> I-0s.Actl 

<220> 

<221> transit_peptide 

<222> (825) . . (971) 

<223> amino terminal TS-Zm.rbcS 

<220> 

<221> intron 

<222> (972) . . (1134) 

<223> I-Zm.rbcS 

<220> 

<221> transit_peptide 

<222> (1135) . . (1221) 

<223> carboxy terminus TS-Zm.rbcS 

<220> 
<221> CDS 

<222> (1222) . . (3180) 

<223> Cry3Bbl variant 11231mvl 

<220> 

<221> terminator 
<222> (3198) . . (3431) 
<223> T-Ta.hspl7 

<400> 17 

gcggccgcgt taacaagctt ctgacgtaag ggatgacgca cctgacgtaa gggatgacgc 60 
acctgacgta agggatgacg cacctgacgt aagggatgac gcactcgaga tccccatctc 120 
cactgacgta agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata 180 
aggaagttca tttcatttgg agaggacacg ctgacaagct agcttggctg caggtagatc 240 
ctagaaccat cttccacaca ctcaagccac actattggag aacacacagg gacaacacac 3 00 
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cataagatcc aagggaggcc tccgccgccg ccggtaacca ccccgcccct ctcctctttc 360 

tttctccgtt tttttttccg tctcggtctc gatctttggc cttggtagtt tgggtgggcg 420 

agaggcggct tcgtgcgcgc ccagatcggt gcgcgggagg ggcgggatct cgcggctggg 480 

gctctcgccg gcgtggatcc ggoccggatc tcgcggggaa tggggctctc ggatgtagat 540 

ctgcgatccg ccgttgttgg gggagatgat ggggggttta aaatttccgc cgtgctaaac 600 

aagatcagga agaggggaaa agggcactat ggtttatatt tttatatatt tctgctgctt 660 

cgtcaggctt agatgtgcta gatctttctt tcttcttttt gtgggtagaa tttgaatccc 720 

tcagcattgt tcatcggtag tttttctttt catgatttgt gacaaatgca gcctcgtgcg 780 

gagctttttt gtaggtagaa gtgatcaacc tctagaggat cagcatggcg cccaccgtga 840 

tgatggcctc gtcggccacc gccgtcgctc cgttcctggg gctcaagtcc accgccagcc 900 

tccccgtcgc ccgccgctcc tccagaagcc tcggcaacgt cagcaacggc ggaaggatcc 960 

ggtgcatgca ggtaacaaat gcatcctagc Cagtagttct tcgcattgca gcagotgcag 1020 

ctagcgagtt agtaatagga agggaactga tgatccatgc atggactgat gtgtgttgcc 1080 

catcccatcc catcccattt cccaaacgaa ccgaaaacac cgtactacgt gcaggtgtgg 1140 

ccctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc gaccggcggg 1200 

cgcatccgot gcatgcaggc c atg gcc aac ccc aac aat cgc tec gag cac 1251 
Met Ala Asn Pro Asn Asn Arg Ser Glu His 



gac aac tec acc gtc aag gac gcc gtc ggg acc ggc ate tec gtc gtt 
Asp Asn Ser Thr Val Lys Asp Ala Val Gly Tbr Gly He Ser Val Val 



tec ttc tac cag tec ttc ctg aac acc ate tgg ccc tec gac gcc gac 
Ser Phe Tyr Gin Ser Phe Leu Asn Thr He Trp Pro Ser Asp Ala Asp 
95 100 105 
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caa aac aac ttc gag gac tac gtc aac gcg ctg aac tec tgg aag aag 1683 
Gin Asn Asn Phe Glu Asp Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys 
140 145 150 

acg cct ctg tec ctg cgc tec aag cgc tec cag ggc cgc ate cgc gag 1731 
Thr Pro Leu Ser Leu Arg Ser Lys Arg Ser Gin Gly Arg lie Arg Glu 



ctg ttc tec cag gec gag tec cac ttc cgc aac tec atg ccg tec ttc 
Leu Phe Ser Gin Ala Glu Ser His Phe Arg Asn Ser Met Pro Ser Phe 



gec gtc tec aag ttc gag gtc ctg ttc ctg ccc acc tac gee cag get 1827 
Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala 



gec aac acc cac etc ctg ttg ctg aag gac gee cag gtc ttc ggc gag 1875 
Ala Asn Thr His Leu Leu Leu Leu Lys Asp Ala Gin Val Phe Gly Glu 
205 210 215 

gaa tgg ggc tac tec teg gag gac gtc gec gag ttc tac cgt cgc cag 1923 
Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin 



ctg aag ctg acc caa cag tac acc gac cac tgc gtc aac tgg tac aac 1971 

Leu Lys Leu Thr Gin Gin Tyr Thr Asp His Cys Val Asn Trp Tyr Asn 

235 240 245 250 

gtc ggc ctg aac ggc ctg agg ggc tec acc tac gac gca tgg gtc aag 2019 

Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys 

255 260 265 

ttc aac cgc ttc cgc agg gag atg acc ctg acc gtc ctg gac ctg ate 2067 

Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp Leu He 



gtc ctg ttc ccc ttc tac gac ate cgc ctg tac tec aag ggc gtc aag 2115 
Val Leu Phe Pro Phe Tyr Asp He Arg Leu Tyr Ser Lys Gly Val Lys 
285 290 295 

acc gag ctg acc cgc gac ate ttc acg gac ccc ate t 
Thr Glu Leu Thr Arg Asp He Phe Thr Asp Pro He I 
300 305 310 

acc etc cag aag tac ggt ccc acc ttc ctg tec ate gag aac tec ate 2211 
Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser He Glu Asn Ser He 
315 320 325 330 

cgc aag ccc cac ctg ttc gac tac etc cag ggc ate gag ttc cac acg 2259 
Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly He Glu Phe His Thr 
335 340 345 

cgc ctg agg cca ggc tac ttc ggc aag gac tec ttc aac tac tgg tec 23 07 
Arg Leu Arg Pro Gly Tyr Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser 



ggc aac tac gtc gag acc agg ccc tec ate ggc tec teg aag acg ate 23 55 
Gly Asn Tyr Val Glu Thr Arg Pro Ser He Gly Ser Ser Lys Thr He 



acc tec cct ttc tac ggc gac aag tec acc gag ccc gtc cag aag etg 2403 
Thr Ser Pro Phe Tyr Gly Asp Lys Ser Thr Glu Pro Val Gin Lys Leu 
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380 385 390 

tec ttc gac ggc cag aag gtc tac cgc acc ate gec aac acc gac gtc 2451 
Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr lie Ala Asn Thr Asp Val 
395 400 405 410 

gcg get tgg ccg aac ggc aag gtc tac ctg ggc gtc acg aag gtc gac 2499 
Ala Ala Trp Pro Asn Gly Lys Val Tyr Leu Gly Val Thr Lys Val Asp 
415 420 425 

ttc tec cag tac gat gac cag aag aat gaa acc tec acc cag acc tac 2 54 7 
Phe Ser Gin Tyr Asp Asp Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr 
430 435 440 

gac tec aag cgc aac aat ggc cac gtc tec gec cag gac tec ate gac 2595 
Asp Ser Lys Arg Asn Asn Gly His Val Ser Ala Gin Asp Ser lie Asp 



I caa gac cgc agg ggc 2 691 
: Gin Asp Arg Arg Gly 
475 480 485 490 

acc ate ccc ttc ttc acc tgg acc cac cgc tec gtc gac ttc ttc aac 2739 
Thr lie Pro Phe Phe Thr Trp Thr His Arg Ser Val Asp Phe Phe Asn 
495 500 505 

acc ate gac gec gag aag ate acc cag ctg ccc gtg gtc aag gec tac 2 78 7 
Thr lie Asp Ala Glu Lys He Thr Gin Leu Pro Val Val Lys Ala Tyr 
510 515 520 

gee ctg tec teg ggt gec tec ate att gag ggt cca ggc ttc acc ggt 2835 
Ala Leu Ser Ser Gly Ala Ser He He Glu Gly Pro Gly Phe Thr Gly 
525 530 535 

ggc aac ctg ctg ttc ctg aag gag tec teg aac tec ate gec aag ttc 2883 
Gly Asn Leu Leu Phe Leu Lys Glu Ser Ser Asn Ser He Ala Lys Phe 
540 545 550 

aag gtc acc ctg aac tec get gee ttg ctg caa cgc tac cgc gtc cgc 2931 
Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg 
555 560 565 570 

ate cgc tac gee tec acc acg aac ctg cgc ctg ttc gtc cag aac tec 2979 
He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu Phe Val Gin Asn Ser 
575 580 585 

aac aat gac ttc ctg gtc ate tac ate aac aag acc atg aac aag gac 3 027 
Asn Asn Asp Phe Leu Val He Tyr He Asn Lys Thr Met Asn Lys Asp 
590 595 600 

gat gac ctg acc tac cag acc ttc gac etc gec acc acg aac tec aac 3075 
Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn 



atg ggc ttc teg ggc gac aag aat gaa ctg ate att ggt get gag tct 
Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He He Gly Ala Glu Se] 
620 625 630 
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635 640 645 650 

3220 

atgctcattc aggttggagc caatttggtt gatgtgtgtg cgagttcttg cgagtctgat 3280 
gagacatctc tgtattgtgt ttctttcccc agtgttttct gtacttgtgt aatcggctaa 3340 
tcgccaacag attcggcgat gaataaatga gaaataaatt gttctgattt tgagtgcaaa 3400 
aaaaaaggaa ttagatctgt gtgtgttttt tggatccccg gggcggccgc 3450 

<210> 18 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 18 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 



Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 



Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
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Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu lie Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp lie Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 

lie Phe Thr Asp Pro lie Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 

Pro Thr Phe Leu Ser lie Glu Asn Ser lie Arg Lys Pro His Leu Phe 
325 330 335 

Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Arg Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
405 410 415 

Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 

Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 

Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 570 575 
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Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

lie Tyr lie Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu lie lie Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 

lie Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 19 
<211> 3039 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 
<220> 

<221> promoter 
<222> (14) . . (235) 
<223> P-CaMV.AS4 

<220> 

<221> 5'UTR 

<222> (240) . . (304) 

<223> L-Ta.hcbl 

<220> 

<221> intron 
<222> (318) . . (805) 
<223> I-Os.Actl 

<220> 

<221> CDS 

<222> (811) . . (2769) 

<223> Cry3Bbl variant 1123 lmvl 

<220> 

<221> terminator 

<222> (2787) . . (3020) 

<223> T-Ta.hspl7 

<400> 19 

gcggccgcgt taacaagctt ctgacgtaag ggatgacgca cctgacgtaa gggatgacgc 6 0 
acctgacgta agggatgacg cacctgacgt aagggatgac gcactcgaga tccccatctc 12 0 
cactgacgta agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata 180 
aggaagttca tttcatttgg agaggacacg ctgacaagct agcttggctg caggtagatc 240 
ctagaaccat cttccacaca ctcaagccac actattggag aacacacagg gacaacacac 300 
cataagatcc aagggaggcc tccgccgccg ccggtaacca ccccgcccct ctcctctttc 360 
tttctccgtt tttttttccg tctcggtctc gatctttggc cttggtagtt tgggtgggcg 420 
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agaggcggct tcgtgcgcgc ccagatcggt gcgcgggagg ggcgggatct cgcggctggg 4 80 

gctctcgccg gcgtggatcc ggcccggatc tcgcggggaa tggggctctc ggatgtagat 540 

ctgcgatccg ccgttgttgg gggagatgat ggggggttta aaatttccgc cgtgctaaac 6 00 

aagatcagga agaggggaaa agggcactat ggtttatatt tttatatatt tctgctgctt 660 

cgtcaggctt agatgtgcta gatctttctt tcttcttttt gtgggtagaa tttgaatccc 720 

tcagcattgt tcatcggtag tttttctttt catgatttgt gacaaatgca gcctcgtgcg 780 

gagctttttt gtaggtagaa gtgatcaacc atg gcc aac ccc aac aat cgc tec 834 
Met Ala Asn Pro Asn Asn Arg Ser 

1 5 

gag cac gac acg ate aag gtc acc ccc aac tec gag etc cag acc aac 8 82 
Glu His Asp Thr lie Lys Val Thr Pro Asn Ser Glu Leu Gin Thr Asn 
10 15 • 20 

cac aac cag tac ccg ctg gcc gac aac ccc aac tec acc ctg gaa gag 930 
His Asn Gin Tyr Pro Leu Ala Asp Asn Pro Asn Ser Thr Leu Glu Glu 
25 30 35 40 

ctg aac tac aag gag ttc ctg cgc atg acc gag gac tec tec acg gag 978 
Leu Asn Tyr Lys Glu Phe Leu Arg Met Thr Glu Asp Ser Ser Thr Glu 
45 50 55 

gtc ctg gac aac tee acc gtc aag gac gcc gtc ggg acc ggc ate tec 1026 
Val Leu Asp Asn Ser Thr Val Lys Asp Ala Val Gly Thr Gly He Ser 
60 65 70 

gtc gtt ggg cag ate ctg ggc gtc gtt ggc gtc ccc ttc gca ggt get 1074 
Val Val Gly Gin He Leu Gly Val Val Gly Val Pro Phe Ala Gly Ala 
75 80 85 

etc acc tec ttc tac cag tec ttc ctg aac acc ate tgg ecc tec gac 1122 
Leu Thr Ser Phe Tyr Gin Ser Phe Leu Asn Thr He Trp Pro Ser Asp 
90 95 100 

gcc gac ccc tgg aag gcc ttc atg gcc caa gtc gaa gtc ctg ate gac 1170 
Ala Asp Pro Trp Lys Ala Phe Met Ala Gin Val Glu Val Leu He Asp 
105 110 115 120 

aag aag ate gag gag tac gcc aag tec aag gcc ctg gcc gag ctg caa 1218 
Lys Lys He Glu Glu Tyr Ala Lys Ser Lys Ala Leu Ala Glu Leu Gin 
125 130 135 

ggc ctg caa aac aac ttc gag gac tac gtc aac gcg ctg aac tec tgg 1266 
Gly Leu Gin Asn Asn Phe Glu Asp Tyr Val Asn Ala Leu Asn Ser Trp 
140 145 150 

aag aag acg cct ctg tec ctg cgc tec aag cgc tec cag ggc cgc ate 1314 
Lys Lys Thr Pro Leu Ser Leu Arg Ser Lys Arg Ser Gin Gly Arg He 
155 160 165 

cgc gag ctg ttc tec cag gcc gag tec cac ttc cgc aac tec atg ccg 1362 
Arg Glu Leu Phe Ser Gin Ala Glu Ser His Phe Arg Asn Ser Met Pro 
170 175 180 

tec ttc gcc gtc tec aag ttc gag gtc ctg ttc ctg ceo ace tac gcc 1410 
Ser Phe Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro Thr Tyr Ala 
185 190 195 200 
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cag get gec aac acc cac etc ctg ttg ctg aag gac gec cag gtc ttc 1458 
Gin Ala Ala Asn Thr His Leu Leu Leu Leu Lys Asp Ala Gin Val Phe 
205 210 215 

ggc gag gaa tgg ggc tac tec teg gag gac gtc gee gag ttc tac cgt 1506 
Gly Glu Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu Phe Tyr Arg 
220 225 230 

cgc cag ctg aag ctg acc caa cag tac acc gac cac tgc gtc aac tgg 1554 
Arg Gin Leu Lys Leu Thr Gin Gin Tyr Thr Asp His Cys Val Asn Trp 
235 240 245 

tac aac gtc ggc ctg aac ggc ctg agg ggc tec acc tac gac gca tgg 1602 
Tyr Asn Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr Asp Ala Trp 
250 255 260 

gtc aag ttc aac cgc ttc cgc agg gag atg acc ctg acc gtc ctg gac 1650 
Val Lys Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp 
265 270 275 280 

ctg ate gtc ctg ttc ccc ttc tac gac ate cgc ctg tac tec aag ggc 1698 
Leu lie Val Leu Phe Pro Phe Tyr Asp He Arg Leu Tyr Ser Lys Gly 
285 290 295 

gtc aag acc gag ctg acc cgc gac ate ttc acg gac ccc ate ttc ctg 1746 
Val Lys Thr Glu Leu Thr Arg Asp He Phe Thr Asp Pre He Phe Leu 
300 305 310 

etc acg acc etc cag aag tac ggt ccc acc ttc ctg tec ate gag aac 1794 
Leu Thr Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser He Glu Asn 
315 320 325 

tec ate cgc aag cec cac ctg ttc gac tac etc cag ggc ate gag ttc 1842 
Ser He Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly He Glu Phe 
330 335 340 

cac acg cgc ctg agg cca ggc tac ttc ggc aag gac tec ttc aac tac 1890 
His Thr Arg Leu Arg Pro Gly Tyr Phe Gly Lys Asp Ser Phe Asn Tyr 
345 350 355 360 

tgg tec ggc aac tac gtc gag acc agg ccc tec ate ggc tec teg aag 1938 
Trp Ser Gly Asn Tyr Val Glu Thr Arg Pro Ser He Gly Ser Ser Lys 
365 370 375 



aag ctg tec ttc gac ggc cag aag gtc tac cgc acc ate gec aac acc 
Lys Leu Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr He Ala Asn Thr 
395 400 405 



gtc gac ttc tec cag tac gat gac cag aag aat gaa acc tec acc cag 
Val Asp Phe Ser Gin Tyr Asp Asp Gin Lys Asn Glu Thr Ser Thr Gin 
425 430 435 440 

acc tac gac tec aag cgc aac aat ggc cac gtc tec gec cag gac tec 
Thr Tyr Asp Ser Lys Arg Asn Asn Gly His Val Ser Ala Gin Asp Ser 
445 450 455 
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ate gac cag ctg ccg cct gag acc act gac gag ccc ctg gag aag gec 2226 
He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu Lys Ala 
460 465 470 

tac tec cac cag ctg aac tac gcg gag tgc ttc ctg atg caa gac cgc 2274 
Tyr Ser His Gin Leu Asn Tyr Ala Glu Cys Phe Leu Met Gin Asp Arg 
475 480 485 

agg ggc acc ate ccc ttc ttc acc tgg acc cac cgc tec gtc gac ttc 2322 
Arg Gly Thr He Pro Phe Phe Thr Trp Thr His Arg Ser Val Asp Phe 
490 495 500 

ttc aac acc ate gac gec gag aag ate acc cag ctg ccc gtg gtc aag 2370 
Phe Asn Thr He Asp Ala Glu Lys He Thr Gin Leu Pro Val Val Lys 
505 510 515 520 

gec tac gec ctg tec teg ggt gee tec ate att gag ggt cca ggc ttc 2418 
Ala Tyr Ala Leu Ser Ser Gly Ala Ser He He Glu Gly Pro Gly Phe 
525 530 535 

acc ggt ggc aac ctg ctg ttc ctg aag gag tec teg aac tec ate gee 2466 
Thr Gly Gly Asn Leu Leu Phe Leu Lys Glu Ser Ser Asn Ser He Ala 
540 545 550 

aag ttc aag gtc acc ctg aac tec get gec ttg ctg caa cgc tac cgc 2514 
Lys Phe Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin Arg Tyr Arg 
555 560 565 

gtc cgc ate cgc tac gec tec acc acg aac ctg cgc ctg ttc gtc cag 2562 
Val Arg He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu Phe Val Gin 
570 575 580 

aac tec aac aat gac ttc ctg gtc ate tac ate aac aag acc atg aac 2610 
Asn Ser Asn Asn Asp Phe Leu Val He Tyr He Asn Lys Thr Met Asn 
585 590 595 600 

aag gac gat gac ctg acc tac cag acc ttc gac etc gec acc acg aac 2658 
Lys Asp Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala Thr Thr Asn 
605 610 615 

tec aac atg ggc ttc teg gge gac aag aat gaa ctg ate att ggt get 2706 
Ser Asn Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He He Gly Ala 
620 625 630 

gag tec ttc gtc tec aat gaa aag ate tac ate gac aag ate gag ttc 2754 
Glu Ser Phe Val Ser Asn Glu Lys He Tyr He Asp Lys He Glu Phe 
635 640 645 

ate ccc gtc cag ctg tgataggaac tctgattgaa ttctgcatgc gtttggacgt 2809 
He Pro Val Gin Leu 
650 

atgetcatte aggttggage caatttggtt gatgtgtgtg cgagttcttg cgagtctgat 2869 

gagacatctc tgtattgtgt ttctttcccc agtgttttct gtacttgtgt aatcggctaa 2929 

tcgccaacag atteggegat gaataaatga gaaataaatt gttctgattt tgagtgcaaa 2989 

aaaaaaggaa ttagatctgt gtgtgttttt tggatccccg gggcggccgc 3039 



<210> 20 
<211> 653 
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<400> 20 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
35 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 

Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp 
130 135 140 

Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg 
145 150 155 160 

Ser Lys Arg Ser Gin Gly Arg He Arg Glu Leu Phe Ser Gin Ala Glu 
165 170 175 

Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 
180 185 190 

Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 
195 200 205 

Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
210 215 220 

Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 

Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 

He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 
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Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe 
325 330 335 
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Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Arg Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
405 410 415 

Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 

Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 

Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 " 570 575 

Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 

He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



54 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/1 1 185 PCT7US99/1 8883 

<210> 21 
<211> 3039 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 
<220> 

<221> promoter 
<222> (14) . . (235) 
<223> P-CaMV.AS4 

<220> 

<221> 5'UTR 

<222> (240) . . (304) 

<223> L-Ta.hcbl 

<220> 

<221> intron 

<222> (318) . . (805) 

<223> I-0s.Actl 

<220> 
<221> CDS 

<222> (811) . . (2769) 

<223> Cry3Bbl variant 11231mv2 

<220> 

<221> terminator 
<222> (2787) . . (3020) 
<223> T-Ta.hspl7 

<400> 21 

gcggccgcgt taacaagctt ctgacgtaag ggatgacgca cctgacgtaa gggatgacgc 60 

acctgacgta agggatgacg cacctgacgt aagggatgac gcactcgaga tccccatctc 120 

cactgacgta agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata 180 

aggaagttca tttcatttgg agaggacacg ctgacaagct agcttggctg caggtagatc 24 0 

ctagaaccat cttccacaca ctcaagccac actattggag aacacacagg gacaacacac 300 

cataagatcc aagggaggcc tccgccgccg ccggtaacca ccccgcccct ctcctctttc 360 

tttctccgtt tttttttccg tctcggtctc gatctttggc cttggtagtt tgggtgggcg 420 

agaggcggct tcgtgcgcgc ccagatcggt gcgcgggagg ggcgggatct cgcggctggg 480 

gctctcgccg gcgtggatcc ggcccggatc tcgcggggaa tggggctctc ggatgtagat 540 

ctgcgatccg ccgttgttgg gggagatgat ggggggttta aaatttccgc cgtgctaaac 600 

aagatcagga agaggggaaa agggcactat ggtttatatt tttatatatt tctgctgctt 66 0 

cgtcaggctt agatgtgcta gatctttctt tcttcttttt gtgggtagaa tttgaatccc 720 

tcagcattgt tcatcggtag tttttctttt catgatttgt gacaaatgca gcctcgtgcg 780 

gagctttttt gtaggtagaa gtgatcaacc atg gcc aac ccc aac aat cgc tec 834 
Met Ala Asn Pro Asn Asn Arg Ser 



55 



5/14/2009, EAST Version: 2.3.0.3 



PCMJS99/18883 



cac aac cag tac ccg ctg gcc gac aac ccc aac tec acc ctg gaa gag 
His Asn Gin Tyr Pro Leu Ala Asp Asn Pro Asn Ser Thr Leu Glu Glu 



gtc gtt ggg cag ate ctg ggc gtc gtt ggc gtc ccc ttc gca ggt get 
Val Val Gly Gin He Leu Gly Val Val Gly Val Pro Phe Ala Gly Ala 



gcc gac ccc tgg aag gcc ttc atg gcc caa gtc gaa gtc ctg ate gac 
Ala Asp Pro Trp Lys Ala Phe Met Ala Gin Val Glu Val Leu He Asp 
105 110 115 120 

aag aag ate gag gag tac gcc aag tec aag gcc ctg gcc gag ctg caa 
Lys Lys lie Glu Glu Tyr Ala Lys Ser Lys Ala Leu Ala Glu Leu Gin 



ggc ctg caa aac aac ttc gag gac tac gtc aac gcg ctg aac tec tgg 
Gly Leu Gin Asn Asn Phe Glu Asp Tyr Val Asn Ala Leu Asn Ser Trp 
140 145 150 



cgc gag ctg ttc tec cag gee gag tec cac ttc cgc aac tec atg ccg 
Arg Glu Leu Phe Ser Gin Ala Glu Ser His Phe Arg Asn Ser Met Pro 



tec ttc gcc gtc tec aag ttc gag gtc ctg ttc ctg ccc acc tac gcc 
Ser Phe Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro Thr Tyr Ala 



cag get gcc aac acc cac etc ctg ttg ctg aag gac gcc cag gtc ttc 
Gin Ala Ala Asn Thr His Leu Leu Leu Leu Lys Asp Ala Gin Val Phe 
205 210 215 

ggc gag gaa tgg ggc tac tec teg gag gac gtc gcc gag ttc tac cgt 
Gly Glu Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu Phe Tyr Arg 
220 225 230 

cgc cag ctg aag ctg ace caa cag tac acc gac cac tgc gtc aac tgg 
Arg Gin Leu Lys Leu Thr Gin Gin Tyr Thr Asp His Cys Val Asn Trp 
235 240 245 

tac aac gtc ggc ctg aac ggc ctg agg ggc tec acc tac gac gca tgg 
Tyr Asn Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr Asp Ala Trp 
250 255 260 
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gtc aag ttc aac cgc ttc cgc agg gag atg acc ctg acc gtc ctg gac 1650 
Val Lys Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp 
265 270 275 280 

ctg ate gtc ctg ttc ccc ttc tac gac ate cgc ctg tac tec aag ggc 1698 
Leu lie Val Leu Phe Pro Phe Tyr Asp lie Arg Leu Tyr Ser Lys Gly 
285 290 295 

gtc aag acc gag ctg acc cgc gac ate ttc acg gac ccc ate ttc ctg 1746 
Val Lys Thr Glu Leu Thr Arg Asp He Phe Thr Asp Pro He Phe Leu 
300 305 310 

etc acg acc etc cag aag tac ggt ccc acc ttc ctg tec ate gag aac 1794 
Leu Thr Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser He Glu Asn 
315 320 325 

tec ate cgc aag ccc cac ctg ttc gac tac etc cag ggc ate gag ttc 1842 
Ser He Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly He Glu Phe 
330 335 340 

cac acg cgc ctg agg cca ggc tac ttc ggc aag gac tec ttc aac tac 1890 
His Thr Arg Leu Arg Pro Gly Tyr Phe Gly Lys Asp Ser Phe Asn Tyr 
345 350 355 360 

tgg tec ggc aac tac gtc gag acc agg ccc tec ate ggc tec teg aag 1938 
Trp Ser Gly Asn Tyr Val Glu Thr Arg Pro Ser He Gly Ser Ser Lys 
365 370 375 

acg ate acc tec cct ttc tac ggc gac aag tec acc gag ccc gtc cag 1986 
Thr He Thr Ser Pro Phe Tyr Gly Asp Lys Ser Thr Glu Pro Val Gin 
380 385 390 

aag ctg tec ttc gac ggc cag aag gtc tac cgc acc ate gee aac acc 2034 
Lys Leu Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr He Ala Asn Thr 



gac gtc gcg get tgg ccg aac ggc aag gtc tac ctg ggc gtc acg aag 

Asp Val Ala Ala Trp Pro Asn Gly Lys Val Tyr Leu Gly Val Thr Lys 
410 415 420 

gtc gac ttc tec cag tac gat gac cag aag aat gaa acc tec acc cag 

Val Asp Phe Ser Gin Tyr Asp Asp Gin Lys Asn Glu Thr Ser Thr Gin 

425 430 435 440 

acc tac gac tec aag cgc aac aat ggc cac gtc tec gee cag gac tec 

Thr Tyr Asp Ser Lys Arg Asn Asn Gly His Val Ser Ala Gin Asp Ser 
445 450 455 

ate gac cag ctg ccg cct gag acc act gac gag ccc ctg gag aag gee 

He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu Lys Ala 
460 465 470 

tac tec cac cag ctg aac tac gcg gag tgc ttc ctg atg caa gac cgc 

Tyr Ser His Gin Leu Asn Tyr Ala Glu Cys Phe Leu Met Gin Asp Arg 



agg ggc acc ate ccc ttc ttc acc tgg acc cac cgc tec gtc gac ttc 
Arg Gly Thr He Pro Phe Phe Thr Trp Thr His Arg Ser Val Asp Phe 



ttc aac acc ate gac gec gag aag ate acc cag ctg ccc gtg gtc aag 
Phe Asn Thr He Asp Ala Glu Lys He Thr Gin Leu Pro Val Val Lys 
505 510 515 520 
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gcc tac gcc ctg tec teg ggt gec tec ate att gag ggt cca ggc ttc 2418 
Ala Tyr Ala Leu Ser Ser Gly Ala Ser lie lie Glu Gly Pro Gly Phe 
525 530 535 

ace ggt ggc aac ctg ctg ttc ctg aag gag tec teg aac tec ate gcc 246S 
Thr Gly Gly Asn Leu Leu Phe Leu Lys Glu Ser Ser Asn Ser He Ala 
540 545 550 

aag ttc aag gtc acc ctg aac tec get gcc ttg ctg caa cgc tac cgc 2514 
Lys Phe Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin Arg Tyr Arg 
555 560 565 

gtc cgc ate cgc tac gcc tec acc acg aac ctg cgc ctg ttc gtc cag 2562 
Val Arg He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu Phe Val Gin 
570 575 580 

aac tec aac aat gac ttc ctg gtc ate tac ate aac aag acc atg aac 2610 
Asn Ser Asn Asn Asp Phe Leu Val He Tyr He Asn Lys Thr Met Asn 
585 590 595 600 

aag gac gat gac ctg acc tac cag acc ttc gac etc gcc acc acg aac 2658 
Lys Asp Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala Thr Thr Asn 
605 610 615 

tec aac atg ggc ttc teg ggc gac aag aat gaa ctg ate att ggt get 2706 
Ser Asn Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He He Gly Ala 
620 625 630 

gag tec ttc gtc tec aat gaa aag ate tac ate gac aag ate gag ttc 2754 
Glu Ser Phe Val Ser Asn Glu Lys He Tyr He Asp Lys He Glu Phe 
635 640 645 

ate ccc gtc cag ctg tgataggaac tctgattgaa ttctgcatgc gtttggacgt 2809 
He Pro Val Gin Leu 
650 



atgetcatte aggttggagc caatttggtt 
gagacatctc tgtattgtgt ttctttcccc 
tcgccaacag atteggegat gaataaatga 
aaaaaaggaa ttagatctgt gtgtgttttt 



gatgtgtgtg cgagttcttg cgagtctgat 2869 
agtgttttct gtacttgtgt aatcggetaa 2929 
gaaataaatt gttctgattt tgagtgcaaa 2989 
tggatccccg gggcggccgc 3 03 9 



<210> 22 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 22 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 

20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 
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Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 

Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp 
130 135 140 

Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg 
145 150 155 160 

Ser Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu 
165 170 175 

Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 
180 185 190 

Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 
195 200 205 

Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
210 215 220 

Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 

Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 

He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 

Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe 
325 330 335 

Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Arg Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
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Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 



Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 



Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 



Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 



Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 



<210> 23 
<211> 3469 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 
<220> 

<221> promoter 
<222> (25) . . (640) 
<223> P-CaMV.35S 

<220> 

<221> 5'UTR 
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<222> (664) . . (734) 
<223> L-Ta.hcbl 



<220> 
<221> CDS 

<222> (1241) . . (3199) 

<223> Cry3Bbl variant 11231mv2 

<220> 

<221> terminator 
<222> (3217) . . (3450) 
<223> T-Ta.hspl7 

<400> 23 

gcggccgcgt taacaagctt ctgcaggtcc gatgtgagac ttttcaacaa agggtaatat 6 0 
ccggaaacct cctcggattc cattgcccag ctatctgtca ctttattgtg aagatagtgg 120 
aaaaggaagg tggctcctac aaatgccatc attgcgataa aggaaaggcc atcgttgaag 180 
atgcctctgc cgacagtggt cccaaagatg gacccccacc cacgaggagc atcgtggaaa 240 
aagaagacgt tccaaccacg tcttcaaagc aagtggattg atgtgatggt ccgatgtgag 3 00 
acttttcaac aaagggtaat atccggaaac ctcctcggat tccattgccc agctatctgt 3S0 
cactttattg tgaagatagt ggaaaaggaa ggtggctcct acaaatgcca tcattgcgat 420 
aaaggaaagg ccatcgttga agatgcctct gccgacagtg gtcccaaaga tggaccccca 480 
cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat 540 
tgatgtgata tctccactga cgtaagggat gacgcacaat cccactatcc ttcgcaagac 600 
ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaca agctgactct 6S0 
agcagatcct ctagaaccat cttccacaca ctcaagccac actattggag aacacacagg 720 
gacaacacac cataagatcc aagggaggcc tccgccgccg ccggtaacca ccccgcccct 780 
ctcctctttc tttctccgtt tttttttccg tctcggtctc gatctttggc cttggtagtt 840 
tgggtgggcg agaggcggct tcgtgcgcgc ccagatcggt gcgcgggagg ggcgggatct 900 
cgcggctggg gctctcgccg gcgtggatcc ggcccggatc tcgcggggaa tggggctctc 960 
ggatgtagat ctgcgatccg ccgttgttgg gggagatgat ggggggttta aaatttccgc 1020 
cgtgctaaac aagatcagga agaggggaaa agggcactat ggtttatatt tttatatatt 1080 
tctgctgctt cgtcaggctt agatgtgcta gatctttctt tcttcttttt gtgggtagaa 1140 
tttgaatccc tcagcattgt tcatcggtag tttttctttt catgatttgt gacaaatgca 1200 
gcctcgtgcg gagctttttt gtaggtagaa gtgatce 

aat cgc tec gag cac gac acg ate aag gtc acc c 
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Asn Arg Ser Glu His Asp Thr lie Lys Val Thr Pro Asn Ser Glu Leu 



ctg gaa gag ctg aac tac aag gag ttc ctg cgc atg acc gag gac tec 
Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg Met Thr Glu Asp Ser 



tec acg gag gtc ctg gac aac tec acc gtc aag gac gec gtc ggg acc 
Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys Asp Ala Val Gly Thr 



ggc ate tec gtc gtt ggg cag ate ctg ggc gtc gtt ggc gtc ccc t 
Gly He Ser Val Val Gly Gin He Leu Gly Val Val Gly Val Pro I 



gca ggt get etc acc tec ttc tac cag tec ttc ctg aac acc ate tgg 
Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe Leu Asn Thr He Trp 



ccc tec gae gee gac ccc tgg aag gec ttc atg gee caa gtc gaa gtc 
Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met Ala Gin Val Glu Val 



ctg ate gac aag aag ate gag gag tac gee aag tec aag gec ctg gec 
Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys Ser Lys Ala Leu Ala 
120 125 130 

gag ctg caa ggc ctg caa aac aac ttc gag gac tac gtc aac gcg ctg 
Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp Tyr Val Asn Ala Leu 



aac tec tgg aag aag acg cct ctg tec ctg cgc tec aag cgc tec cag 
Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg Ser Lys Arg Ser Gin 
150 155 160 1G5 



tec atg ccg tec ttc gec gtc tec aag ttc gag gtc ctg ttc ctg ccc 
Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro 
185 190 195 



cag gtc ttc ggc gag gaa tgg ggc tac tec teg gag gac gtc gec gag 

Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu 
215 220 225 

ttc tac cgt cgc cag ctg aag ctg acc caa cag tac acc gac cac tgc 

Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin Tyr Thr Asp His Cys 

230 235 240 245 

gtc aac tgg tac aac gtc ggc ctg aac ggc ctg agg ggc tec acc tac 
Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr 

250 255 260 

gac gca tgg gtc aag ttc aac cgc ttc cgc agg gag atg acc ctg acc 
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Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr 
265 270 275 

gtc ctg gac ctg ate gtc ctg ttc ccc ttc tac gac ate cgc ctg tac 2119 
Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr Asp He Arg Leu Tyr 
280 285 290 

tec aag ggc gtc aag acc gag ctg acc cgc gac ate ttc acg gac ccc 2167 
Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp He Phe Thr Asp Pro 
295 300 305 

ate ttc ctg etc acg acc etc cag aag tac ggt ccc acc ttc ctg tec 2215 
He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser 
310 315 320 325 

ate gag aac tec ate cgc aag ccc cac ctg ttc gac tac etc cag ggc 2263 
He Glu Asn Ser He Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly 
330 335 340 

ate gag ttc cac acg cgc ctg agg cca ggc tac ttc ggc aag gac tec 2311 
He Glu Phe His Thr Arg Leu Arg Pro Gly Tyr Phe Gly Lys Asp Ser 
345 350 355 

ttc aac tac tgg tec ggc aac tac gtc gag acc agg ccc tec ate ggc 2359 
Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr Arg Pro Ser He Gly 
360 365 370 

tec teg aag acg ate acc tec cct ttc tac ggc gac aag tec acc gag 2407 
Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly Asp Lys Ser Thr Glu 
375 380 385 

ccc gtc cag aag ctg tec ttc gac ggc cag aag gtc tac cgc acc ate 2455 
Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr He 
390 395 400 405 

gee aac acc gac gtc gcg get tgg ccg aac ggc aag gtc tac ctg ggc 2503 
Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly Lys Val Tyr Leu Gly 



tec acc cag ace tac gac tec aag cgc aac aat ggc cac gtc tec gee 2599 

Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn Gly His Val Ser Ala 
440 445 450 

cag gac tec ate gac cag ctg ccg cct gag acc act gac gag ccc ctg 2647 

Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 



gag aag gec tac tec cac cag ctg aac tac gcg gag tgc ttc ctg atg 2695 
Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala Glu Cys Phe Leu Met 
470 475 480 485 

caa gac cgc agg ggc acc ate ccc ttc ttc acc tgg acc cac cgc tec 2743 
Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr Trp Thr His Arg Ser 
490 495 500 

gtc gac ttc ttc aac acc ate gac gec gag aag ate acc cag ctg ccc 2791 
Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys He Thr Gin Leu Pro 
505 510 515 

gtg gtc aag gee tac gee ctg tec teg ggt gec tec ate att gag ggt 2839 
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Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala Ser He He Glu Gly 
520 525 530 
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cca ggc ttc acc ggt ggc aac ctg ctg ttc ctg aag gag tec teg aac 2 88 7 
Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu Lys Glu Ser Ser Asn 
535 540 545 

tec ate gec aag ttc aag gtc acc ctg aac tec get gec ttg ctg caa 2935 
Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin 
550 555 560 565 

cgc tac cgc gtc cgc ate cgc tac gec tec acc acg aac ctg cgc ctg 2983 
Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu 
570 575 580 

ttc gtc cag aac tec aac aat gac ttc ctg gtc ate tac ate aac aag 3 031 
Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val He Tyr He Asn Lys 
585 590 595 

acc atg aac aag gac gat gac ctg acc tac cag acc ttc gac etc gec 3079 
Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala 
600 605 610 

acc acg aac tec aac atg ggc ttc teg ggc gac aag aat gaa ctg ate 3127 
Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He 
615 620 625 

att ggt get gag tec ttc gtc tec aat gaa aag ate tac ate gac aag 3175 
He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys He Tyr He Asp Lys 
630 635 640 645 

ate gag ttc ate ccc gtc cag ctg tgataggaac tctgattgaa ttctgcatgc 3229 
He Glu Phe He Pro Val Gin Leu 
650 



gtttggacgt atgetcatte aggttggagc 
cgagtctgat gagacatcte tgtattgtgt 
aatcggctaa tcgccaacag atteggegat 
tgagtgcaaa aaaaaaggaa ttagatctgt 



caatttggtt gatgtgtgtg cgagttcttg 3289 
ttctttcccc agtgttttct gtacttgtgt 3349 
gaataaatga gaaataaatt gttctgattt 3409 
gtgtgttttt tggatccccg gggcggccgc 3469 



<210> 24 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 24 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 
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Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr lie Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu lie Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 



Ser Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu 



Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 



Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 



Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Arg Pro Gly Tyr 



Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 



Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
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Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 

Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 

Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 570 575 

Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 

He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 



<210> 25 
<211> 416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: non-naturally 
occurring nucleotide sequence encoding Zea mays 
ribulose bis-phosphate carboxylase chloroplast 
targeting peptide 

<220> 

<221> CDS 

<222> (16) . . (162) 

<220> 

<221> CDS 

<222> (326) . . (415) 
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<400> 25 

ttctagagga tcagc atg gcg ccc acc gtg atg atg gcc teg teg gec acc 
Met Ala Pro Thr Val Met Met Ala Ser Ser Ala Thr 



ate egg tgc atg cag gtaacaaatg catcctagct agtagttctt tgcattgcag 
lie Arg Cys Met Gin 



cagctgcagc tagegagtta gtaataggaa gggaactgat gatccatgea tggactgatg 2 52 

tgtgttgccc atcccatccc atcccatttc ccaaacgaac cgaaaacacc gtactacgtg 322 

cag gtg tgg ccc tac ggc aac aag aag ttc gag acg ctg teg tac ctg 3 70 
Val Trp Pro Tyr Gly Asn Lys Lys Phe Glu Thr Leu Ser Tyr Leu 



ccg ccg ctg teg acc ggc ggg cgc ate cgc tgc atg cag gcc atg g 
Pro Pro Leu Ser Thr Gly Gly Arg lie Arg Cys Met Gin Ala Met 



<210> 26 
<211> 79 
<212> PRT 

<213> Artificial Sequence 
<400> 26 

Met Ala Pro Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro 
15 10 15 

Phe Leu Gly Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser 
20 25 30 

Ser Arg Ser Leu Gly Asn Val Ser Asn Gly Gly Arg lie Arg Cys Met 
35 40 45 

Gin Val Trp Pro Tyr Gly Asn Lys Lys Phe Glu Thr Leu Ser Tyr Leu 
50 55 60 

Pro Pro Leu Ser Thr Gly Gly Arg He Arg Cys Met Gin Ala Met 
65 70 75 



<210> 27 
<211> 49 
<212> PRT 

<213> Artificial Sequence 
<400> 27 
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Met Ala Pro Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro 
15 10 15 
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Phe Leu Gly Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser 
20 25 30 

Ser Arg Ser Leu Gly Asn Val Ser Asn Gly Gly Arg lie Arg Cys Met 
35 40 45 

Gin 



<210> 28 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<400> 28 

Val Trp Pro Tyr Gly Asn Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro 
15 10 15 

Pro Leu Ser Thr Gly Gly Arg He Arg Cys Met Gin Ala Met 
20 25 30 



<210> 29 
<211> 202 
<212> DNA 

<213> Cauliflower mosaic virus 
<400> 29 

gacgcacctg acgtaaggga tgacgcacct gacgtaaggg atgacgcacc tgacgtaagg SO 
gatgacgcac tcgagatccc catctccact gacgtaaggg atgacgcaca atcccactat 120 
ccttcgcaag acccttcctc tatataagga agttcatttc atttggagag gacacgctga 180 
caagctagct tggctgcagg ta 202 



<210> 30 

<211> 416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: modified 
cauliflower mosaic virus promoter AS 4 

<400> 30 

ttctagagga tcagcatggc gcccaccgtg atgatggcct cgtcggccac cgccgtcgct 60 
ccgttcctgg ggctcaagtc caccgccagc ctccccgtcg cccgccgctc ctccagaagc 120 
ctcggcaacg tcagcaacgg cggaaggatc cggtgcatgc aggtaacaaa tgcatcctag 180 
ctagtagttc tttgcattgc agcagctgca gctagcgagt tagtaatagg aagggaactg 240 
atgatccatg catggactga tgtgtgttgc ccatcccatc ccatcccatt tcccaaacga 300 
accgaaaaca ccgtactacg tgcaggtgtg gccctacggc aacaagaagt tcgagacgct 360 
gtcgtacctg ccgccgctgt cgaccggcgg gcgcatccgc tgcatgcagg ccatgg 416 
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<210> 31 
<211> 75 
<212> DNA 

<213> Triticum aestivum 
<400> 31 



ctagaaccat 


cttccacaca 


ctcaagccac actattggag aacacacagg gacaacacac 


60 


cataagatcc 


aaggg 








75 


<210> 32 
<211> 804 
<212> DNA 
<213> Oryza sp . 










accgtcttcg 


gtacgcgctc 


actccgccct ctgcctttgt 


tactgccacg 


tttctctgaa 


60 


tgctctcttg 


tgtggtgatt 


gctgagagtg gtttagctgg atctagaatt acactctgaa 


120 


atcgtgttct 


gcctgtgctg 


attacttgcc gtcctttgta gcagcaaaat 


atagggacat 


180 


ggtagtacga 


aacgaagata 


gaacctacac agcaatacga 


gaaatgtgta 


atttggtgct 


240 


tagcggtatt 


tatttaagca 


catgttggtg ttatagggca 


cttggattca 


gaagtttgct 


300 


gttaatttag 


gcacaggctt 


catactacat gggtcaatag 


tatagggatt 


catattatag 


360 


gcgatactat 


aataatttgt 


tcgtctgcag agcttattat 


ttgccaaaat 


tagatattcc 


420 


tattctgttt 


ttgtttgtgt 


gctgttaaat tgttaacgcc 


tgaagqaata 


aatataaatg 


480 


acgaaatttt 


gatgtttatc 


tctgctcctt tattgtgacc 


ataagtcaag 


atcagatgca 


540 


cttgttttaa 


atattgttgt 


ctgaagaaat aagtactgac 


agtattttga tgcattgatc 


600 


tgcttgtttg 


Ctgtaacaaa 


atttaaaaat aaagagtttc 


ctttttgttg 


ctctccttac 


660 


ctcctgatgg 


tatctagtat 


ctaccaactg acactatatt 


gcttctcttt 


acatacgtat 


720 


cttgctcgat 




tagtgttgac cagtgttact 


cacatagtct 


ttgctcattt 




cattgtaatg 


cagataccaa 


gcgg 






804 



<210> 33 
<211> 804 
<212> DNA 
<213> Zea mays 

<400> 33 

accgtcttcg gtacgcgctc 
tgctctcttg tgtggtgatt 
atcgtgttct gcctgtgctg 
ggtagtacga aacgaagata 
tagcggtatt tatttaagca 
gttaatttag gcacaggctt 
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gctgagagtg gtttagctgg atctagaatt acactctgaa 120 
attacttgcc gtcctttgta gcagcaaaat atagggacat 180 
gaacctacac agcaatacga gaaatgtgta atttggtgct 240 
catgttggtg ttatagggca cttggattca gaagtttgct 3 00 
catactacat gggtcaatag tatagggatt catattatag 360 
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gcgatactat aataatttgt tcgtctgcag agcttattat ttgccaaaat tagatattcc 420 
tattctgttt ttgtttgtgt gctgttaaat tgttaacgcc tgaaggaata aatataaatg 480 
acgaaatttt gatgtttatc tctgctcctt tattgtgacc ataagtcaag atcagatgca 540 
cttgttttaa atattgttgt ctgaagaaat aagtactgac agtattttga tgcattgatc 600 
tgcttgtttg ttgtaacaaa atttaaaaat aaagagtttc ctttttgttg ctctccttac 660 
ctcctgatgg tatctagtat ctaccaactg acactatatt gcttctcttt acatacgtat 720 
cttgctcgat gccttctccc tagtgttgac cagtgttact cacatagtct ttgctcattt 780 
cattgtaatg cagataccaa gcgg 804 

<210> 34 
<211> 257 
<212> DNA 

<213> Agrobacterium tumefaciens 
<400> 34 

tcccgatcgt tcaaacattt ggcaataaag tttcttaaga Ctgaatcctg ttgccggtct 60 
tgcgatgatt atcatataat ttctgttgaa ttacgttaag catgtaataa ttaacatgta 120 
atgcatgacg ttatttatga gatgggtttt tatgattaga gtcccgcaat tatacattta 180 
atacgcgata gaaaacaaaa tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc 24 0 
atctatgtta ctagatc 257 

<210> 35 
<211> 234 
<212> DNA 

<213> Triticum aestivum 
<400> 35 

aattctgcat gcgtttggac gtatgctcat tcaggttgga gccaatttgg ttgatgtgtg 60 
tgcgagttct tgcgagtctg atgagacatc tctgtattgt gtttctttcc ccagtgtttt 120 
ctgtacttgt gtaatcggct aatcgccaac agattcggcg atgaataaat gagaaataaa 180 
ttgttctgat tttgagtgca aaaaaaaagg aattagatct gtgtgtgttt tttg 234 

<210> 36 
<211> 3455 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 



<221> promoter 
<222> (14) . . (235) 
<223> P.CaMV.AS4 
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<220> 

<221> 5'UTR 

<222> (240) . . (304) 

<223> L-Ta.hcbl 

<220> 

<221> intron 
<222> (318) . . (805) 
<223> I-Os.Actl 

<220> 

<221> transit_peptide 
<222> (825) . . (971) 

<223> TS-Zm.rbcS amino terminal coding sequence upstream 
of Zea mays rbcS intron 

<220> 

<221> intron 

<222> (972) . . (1134) 

<223> I-Zm.rbcS 

<220> 

<221> transit_peptide 
<222> (1135) . . (1221) 

<223> TS-Zm.rbcS carboxy terminus coding sequence 
downstream of Zea mays rbcS intron 

<220> 
<221> CDS 

<222> (1222) . . (3180) 

<223> variant Cry3BBl coding sequence encoding V11231 
<220> 

<221> terminator 
«222> (3198) . . (3431) 
<223> T-Ta.hspl7 

<400> 36 

gcggccgcgt taacaagctt ctgacgtaag ggatgacgca cctgacgtaa gggatgacgc 60 

acctgacgta agggatgacg cacctgacgt aagggatgac gcactcgaga tccccatctc 120 

cactgacgta agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata 180 

aggaagttca tttcatttgg agaggacacg ctgacaagct agcttggctg caggtagatc 240 

ctagaaccat cttccacaca ctcaagccac actattggag aacacacagg gacaacacac 300 

cataagatcc aagggaggcc tccgccgccg ccggtaacca ccccgcccct ctcctctttc 360 

tttctccgtt tttttttccg tctcggtctc gatctttggc cttggtagtt tgggtgggcg 42 0 

agaggcggct tcgtgcgcgc ccagatcggt gcgcgggagg ggcgggatct cgcggctggg 480 

gctctcgccg gcgtggatcc ggcccggatc tcgcggggaa tggggctctc ggatgtagat 540 

ctgcgatccg ccgttgttgg gggagatgat ggggggttta aaatttccgc cgtgctaaac 600 

aagatcagga agaggggaaa agggcactat ggtttatatt tttatatatt tctgctgctt 660 

cgtcaggctt agatgtgcta gatctttctt tcttcttttt gtgggtagaa tttgaatccc 720 

tcagcattgt tcatcggtag tttttctttt catgatttgt gacaaatgca gcctcgtgcg 780 
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gagctttttt gtaggtagaa gtgatcaacc tctagaggat cagcatggcg cccaccgtga 840 



tgatggcctc gtcggccacc gccgtcgctc cgttcctggg gctcaagtcc accgccagcc 900 
tccccgtcgc ccgccgctcc tccagaagcc tcggcaacgt cagcaacggc ggaaggatcc 960 
ggtgcatgca ggtaacaaat gcatcctagc tagtagttct ttgcattgca gcagctgcag 1020 
ctagcgagtt agtaatagga agggaactga tgatccatgc atggactgat gtgtgttgcc 1080 
catcccatcc catcccattt cccaaacgaa ccgaaaacac cgtactacgt gcaggtgtgg 1140 
ccctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc gaccggcggg 1200 
cgcatccgct gcatgcaggc c atg gca aac cct aac aat cgt tec gaa cac 1251 



tac aag gag ttt etc egg atg acc gaa gat age tec act gag gtt etc 
Tyr Lys Glu Phe Leu Arg Met Thr Glu Asp Ser Ser Thr Glu Val Leu 



gat aac tct aca gtg aag gac get gtt gga act ggc att age gtt gtg 
Asp Asn Ser Thr Val Lys Asp Ala Val Gly Thr Gly He Ser Val Val 



age ttc tac cag tec ttt etc aac acc ate tgg cct tea gat get gat 
Ser Phe Tyr Gin Ser Phe Leu Asn Thr He Trp Pro Ser Asp Ala Asp 
95 100 105 

ccc tgg aag get ttc atg gee caa gtg gaa gtc ttg ate gat aag aag 
Pro Trp Lys Ala Phe Met Ala Gin Val Glu Val Leu He Asp Lys Lys 



ate gaa gag tat gec aag tct aaa gec ttg get gag ttg caa ggt ttg 
lie Glu Glu Tyr Ala Lys Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu 



cag aac aac ttc gag gat tac gtc aac gca etc aac age tgg aag aaa 

Gin Asn Asn Phe Glu Asp Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys 
140 145 150 

act ccc ttg agt etc agg tct aag cgt tec cag gac cgt att cgt gaa 

Thr Pro Leu Ser Leu Arg Ser Lys Arg Ser Gin Asp Arg He Arg Glu 

155 160 165 170 



: Gin Ala Glu Ser His Phe Arg Asn Ser Met 



gee gtt tct aag ttc gag gtg etc ttc ttg cca aca tac gca caa get 
Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala 
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gcc aac act cat etc ttg ctt etc aaa gac get cag gtg ttt ggt gag 1875 
Ala Asn Thr His Leu Leu Leu Leu Lys Asp Ala Gin Val Phe Gly Glu 



gaa tgg ggt tac tec agt gaa gat gtt gcc gag ttc tac cgt agg cag 1923 
Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin 



etc aag ttg act caa cag tac aca gac cac tgc gtc aac tgg tac aac 1971 
Leu Lys Leu Thr Gin Gin Tyr Thr Asp His Cys Val Asn Trp Tyr Asn 
235 240 245 250 

gtt ggg etc aat ggt ctt aga gga tct acc tac gac gca tgg gtg aag 2019 
Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys 
255 260 2S5 

ttc aac agg ttt cgt aga gag atg acc ttg act gtg etc gat ctt ate 2067 
Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp Leu lie 
270 275 280 

gtt etc ttt cca ttc tac gac att cgt ctt tac tec aaa ggc gtt aag 2115 
Val Leu Phe Pro Phe Tyr Asp lie Arg Leu Tyr Ser Lys Gly Val Lys 



aca gag ctg acc aga gac ate ttc acc gat ccc ate ttc eta ctt acg 2163 
Thr Glu Leu Thr Arg Asp lie Phe Thr Asp Pro He Phe Leu Leu Thr 
300 305 310 

acc ctg cag aaa tac ggt cca act ttt etc tec att gag aac age ate 2211 
Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser He Glu Asn Ser He 
315 320 325 330 

agg aag cct cac etc ttc gac tat ctg caa ggc att gag ttt cac acc 2259 
Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly He Glu Phe His Thr 
335 340 345 

agg ttg caa cct ggt tac ttc ggt aag gat tec ttc aac tac tgg age 23 07 
Arg Leu Gin Pro Gly Tyr Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser 
350 355 360 

gga aac tac gtt gaa acc aga cca tec ate gga tct age aag acc ate 2355 
Gly Asn Tyr Val Glu Thr Arg Pro Ser He Gly Ser Ser Lys Thr lie 
365 370 375 

act tct cca ttc tac ggt gac aag age act gag cca gtg cag aag ttg 24 03 
Thr Ser Pro Phe Tyr Gly Asp Lys Ser Thr Glu Pro Val Gin Lys Leu 
380 385 390 

age ttc gat ggg cag aag gtg tat aga acc ate gcc aat acc gat gtt 2451 
Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr He Ala Asn Thr Asp Val 



gca get tgg cct aat ggc aag gtc tac ctt gga gtt act aaa gtg gac 
Ala Ala Trp Pro Asn Gly Lys Val Tyr Leu Gly Val Thr Lys Val Asp 
415 420 425 

ttc tec caa tac gac gat cag aag aac gag aca tct act caa acc tac 
Phe Ser Gin Tyr Asp Asp Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr 
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caa ctt cca cct gaa acc act gat gaa cca ttg gag aag get tac agt 2643 
Gin Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser 



: caa ctt aac tac gec gaa tgc ttt etc atg caa gac agg cgt ggc 2691 
; Gin Leu Asn Tyr Ala Glu Cys Phe Leu Met Gin Asp Arg Arg Gly 



acc att ccg ttc ttt aca tgg act cac agg tct gtc gac ttc ttt aac 2739 
Thr lie Pro Phe Phe Thr Trp Thr His Arg Ser Val Asp Phe Phe Asn 
495 500 505 

act ate gac get gag aag att acc caa ctt ccc gtg gtc aag get tat 2787 
Thr He Asp Ala Glu Lys He Thr Gin Leu Pro Val Val Lys Ala Tyr 
510 515 520 

gec ttg tec age gga get tec ate att gaa ggt cca ggc ttc acc ggt 2835 
Ala Leu Ser Ser Gly Ala Ser He He Glu Gly Pro Gly Phe Thr Gly 
525 530 535 



aaa gtg aca ctt aac tea gca gec ttg etc caa cgt tac agg gtt cgt 2931 
Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg 
555 560 565 570 

ate aga tac gca age act acc aat ctt cgc etc ttt gtc cag aac age 2979 
He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu Phe Val Gin Asn Ser 
575 580 585 

aac aat gat ttc ctt gtc ate tac ate aac aag act atg aac aaa gac 3 027 
Asn Asn Asp Phe Leu Val He Tyr He Asn Lys Thr Met Asn Lys Asp 
590 595 600 

gat gac etc acc tac caa aca ttc gat ctt gec act acc aat agt aac 3075 
Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn 
605 610 615 

atg gga ttc tct ggt gac aag aac gag ctg ate ata ggt get gag age 3123 
Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He He Gly Ala Glu Ser 
620 625 630 

ttt gtc tct aat gag aag att tac ata gac aag ate gag ttc att cca 3171 
Phe Val Ser Asn Glu Lys He Tyr He Asp Lys He Glu Phe He Pro 
635 640 645 650 

3220 



gaegtatget cattcaggtt ggagecaatt tggttgatgt gtgtgcgagt tettgegagt 3280 
ctgatgagac atctctgtat tgtgtttctt tccccagtgt tttctgtact tgtgtaatcg 3340 
gctaatcgcc aacagattcg gcgatgaata aatgagaaat aaattgttct gattttgagt 3400 
gcaaaaaaaa aggaattaga tctgtgtgtg ttttttggat ccccggggcg geege 3455 



<210> 37 
<211=> 653 
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<400> 37 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr lie Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 

Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 

Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp 
130 135 140 

Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg 
145 150 155 160 

Ser Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu 
165 170 175 

Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 
180 185 190 

Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 
195 200 205 

Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
210 215 220 

Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 

Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 

He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 
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325 330 335 
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Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 
405 410 415 

Lys Val Tyr Leu Gly Val Thr Lys Val Asp Phe Ser Gin Tyr Asp Asp 
420 425 430 

Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 
435 440 445 

Gly His Val Ser Ala Gin Asp Ser He Asp Gin Leu Pro Pro Glu Thr 
450 455 460 

Thr Asp Glu Pro Leu Glu Lys Ala Tyr Ser His Gin Leu Asn Tyr Ala 
465 470 475 480 

Glu Cys Phe Leu Met Gin Asp Arg Arg Gly Thr He Pro Phe Phe Thr 
485 490 495 

Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 
500 505 510 

He Thr Gin Leu Pro Val Val Lys Ala Tyr Ala Leu Ser Ser Gly Ala 
515 520 525 

Ser He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 
530 535 540 

Lys Glu Ser Ser Asn Ser He Ala Lys Phe Lys Val Thr Leu Asn Ser 
545 550 555 560 

Ala Ala Leu Leu Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr 
565 ~ 570 575 

Thr Asn Leu Arg Leu Phe Val Gin Asn Ser Asn Asn Asp Phe Leu Val 
580 585 590 

He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 
595 600 605 

Thr Phe Asp Leu Ala Thr Thr Asn Ser Asn Met Gly Phe Ser Gly Asp 
610 615 620 

Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 
625 630 635 640 

He Tyr He Asp Lys He Glu Phe He Pro Val Gin Leu 
645 650 
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<210> 38 
<211> 3044 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: expression 
cassette 

<220> 

<221> promoter 
<222> (14) . . (235) 
<223> P-CaMV.AS4 

<220> 

<221> 5'UTR 

<222> (240) . . (304) 

<223> L-Ta.hcbl 

<220> 

<221> intron 

<222> (318) . . (805) 

<223> I-0s.Actl 

<220> 
<221> CDS 

<222> (811) . . (2769) 

<223> variant Cry3Bbl coding sequence encoding V11231 
<220> 

<22l> terminator 
<222> (2792) . . (3025) 
<223> T-Ta.hspl7 



gcggccgcgt taacaagctt ctgacgtaag ggatgacgca cctgacgtaa gggatgacgc 60 

acctgacgta agggatgacg cacctgacgt aagggatgac gcactcgaga tccccatctc 120 

cactgacgta agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata 180 

aggaagttca tttcatttgg agaggacacg ctgacaagct agcttggctg caggtagatc 240 

ctagaaccat cttccacaca ctcaagccac actattggag aacacacagg gacaacacac 300 

cataagatcc aagggaggcc tccgccgccg ccggtaacca ccccgcccct ctcctctttc 360 

tttctccgtt tttttttccg tctcggtctc gatctttggc cttggtagtt tgggtgggcg 420 

agaggcggct tcgtgcgcgc ccagatcggt gcgcgggagg ggcgggatct cgcggctggg 4 80 

gctctcgccg gcgtggatcc ggcccggatc tcgcggggaa tggggctctc ggatgtagat 540 

ctgcgatccg ccgttgttgg gggagatgat ggggggttta aaatttccgc cgtgctaaac 600 

aagatcagga agaggggaaa agggcactat ggtttatatt tttatatatt tctgctgctt S60 

cgtcaggctt agatgtgcta gatctttctt tcttcttttt gtgggtagaa tttgaatccc 720 

tcagcattgt tcatcggtag tttttctttt catgatttgt gacaaatgca gcctcgtgcg 780 

gagctttttt gtaggtagaa gtgatcaacc atg gca aac cct aac aat cgt tec 834 
Met Ala Asn Pro Asn Asn Arg Ser 
1 5 



77 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/1 1 1 85 PCT/US99/1 8883 



cac aac cag tac cca ttg get gac aat cct aac agt act ctt gag gaa 
His Asn Gin Tyr Pro Leu Ala Asp Asn Pro Asn Ser Thr Leu Glu Glu 



get gat ccc tgg aag get ttc atg gee caa gtg gaa gtc ttg ate gat 
Ala Asp Pro Trp Lys Ala Phe Met Ala Gin Val Glu Val Leu He Asp 
105 110 115 120 



aag aaa act ccc ttg agt etc agg tct aag cgt tec cag gac cgt att 
Lys Lys Thr Pro Leu Ser Leu Arg Ser Lys Arg Ser Gin Asp Arg He 



cgt gaa ctt ttc age caa gec gaa tec cac ttc aga aac tec atg cct 
Arg Glu Leu Phe Ser Gin Ala Glu Ser His Phe Arg Asn Ser Met Pro 
170 175 180 

age ttt gee gtt tct aag ttc gag gtg etc ttc ttg cca aca tac gca 
Ser Phe Ala Val Ser Lys Phe Glu Val Leu Phe Leu Pro Thr Tyr Ala 



caa get gee aac act cat etc ttg ctt etc aaa gac get cag gtg ttt 
Gin Ala Ala Asn Thr His Leu Leu Leu Leu Lys Asp Ala Gin Val Phe 
205 210 215 

ggt gag gaa tgg ggt tac tec agt gaa gat gtt gec gag ttc tac cgt 
Gly Glu Glu Trp Gly Tyr Ser Ser Glu Asp Val Ala Glu Phe Tyr Arg 
220 225 230 

agg cag etc aag ttg act caa cag tac aca gac cac tgc gtc aac tgg 
Arg Gin Leu Lys Leu Thr Gin Gin Tyr Thr Asp His Cys Val Asn Trp 
235 240 245 

tac aac gtt ggg etc aat ggt ctt aga gga tct ace tac gac gca tgg 
Tyr Asn Val Gly Leu Asn Gly Leu Arg Gly Ser Thr Tyr Asp Ala Trp 
250 255 260 
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gtg aag ttc aac agg ttt cgt aga gag atg acc ttg act gtg etc gat 1650 
Val Lys Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp 
265 270 275 280 

ctt ate gtt etc ttt cca ttc tac gac att cgt ctt tac tec aaa ggc 1698 
Leu lie Val Leu Phe Pro Phe Tyr Asp lie Arg Leu Tyr Ser Lys Gly 
285 290 295 

gtt aag aca gag ctg acc aga gac ate ttc acc gat ccc ate tte eta 174 6 
Val Lys Thr Glu Leu Thr Arg Asp lie Phe Thr Asp Pro lie Phe Leu 
300 305 310 

ctt acg acc ctg cag aaa tac ggt cca act ttt etc tec att gag aac 1794 
Leu Thr Thr Leu Gin Lys Tyr Gly Pro Thr Phe Leu Ser lie Glu Asn 
315 320 325 

age ate agg aag cct cac etc ttc gac tat ctg caa ggc att gag ttt 1842 
Ser lie Arg Lys Pro His Leu Phe Asp Tyr Leu Gin Gly lie Glu Phe 
330 335 340 

cac acc agg ttg caa cct ggt tac ttc ggt aag gat tec ttc aac tac 1890 
His Thr Arg Leu Gin Pro Gly Tyr Phe Gly Lys Asp Ser Phe Asn Tyr 
345 350 355 360 

tgg age gga aac tac gtt gaa acc aga cca tec ate gga tct age aag 1938 
Trp Ser Gly Asn Tyr Val Glu Thr Arg Pro Ser lie Gly Ser Ser Lys 
365 370 375 

acc ate act tct cca ttc tac ggt gac aag age act gag cca gtg cag 1986 
Thr He Thr Ser Pro Phe Tyr Gly Asp Lys Ser Thr Glu Pro Val Gin 
380 385 390 

aag ttg age ttc gat ggg cag aag gtg tat aga acc ate gee aat acc 2034 
Lys Leu Ser Phe Asp Gly Gin Lys Val Tyr Arg Thr He Ala Asn Thr 
395 400 405 

gat gtt gca get tgg cct aat ggc aag gtc tac ctt gga gtt act aaa 2082 
Asp Val Ala Ala Trp Pro Asn Gly Lys Val Tyr Leu Gly Val Thr Lys 
410 415 420 

gtg gac ttc tec caa tac gac gat cag aag aac gag aca tct act caa 2130 
Val Asp Phe Ser Gin Tyr Asp Asp Gin Lys Asn Glu Thr Ser Thr Gin 
425 430 435 440 

acc tac gat agt aag agg aac aat ggc cat gtt tec gca caa gac tec 2178 
Thr Tyr Asp Ser Lys Arg Asn Asn Gly His Val Ser Ala Gin Asp Ser 
445 450 455 

att gac caa ctt cca cct gaa acc act gat gaa cca ttg gag aag get 2226 
He Asp Gin Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu Lys Ala 
460 465 470 

tac agt cac caa ctt aac tac gec gaa tgc ttt etc atg caa gac agg 2274 
Tyr Ser His Gin Leu Asn Tyr Ala Glu Cys Phe Leu Met Gin Asp Arg 
475 480 485 

cgt ggc acc att ccg ttc ttt aca tgg act cac agg tct gtc gac ttc 2322 
Arg Gly Thr He Pro Phe Phe Thr Trp Thr His Arg Ser Val Asp Phe 
490 495 500 

ttt aac act ate gac get gag aag att acc caa ctt cec gtg gtc aag 2370 
Phe Asn Thr He Asp Ala Glu Lys He Thr Gin Leu Pro Val Val Lys 
505 510 515 520 
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get tat gec ttg tec age gga get tec ate att gaa ggt cca ggc ttc 2418 
Ala Tyr Ala Leu Ser Ser Gly Ala Ser lie He Glu Gly Pro Gly Phe 
525 530 535 

ace ggt ggc aac ttg etc ttc ctt aag gag tec age aac tec ate gee 2466 
Thr Gly Gly Asn Leu Leu Phe Leu Lys Glu Ser Ser Asn Ser He Ala 
540 545 550 

aag ttc aaa gtg aca ctt aac tea gca gee ttg etc caa cgt tac agg 2514 
Lys Phe Lys Val Thr Leu Asn Ser Ala Ala Leu Leu Gin Arg Tyr Arg 
555 560 565 

gtt cgt ate aga tac gca age act acc aat ctt cgc etc ttt gtc cag 2562 
Val Arg He Arg Tyr Ala Ser Thr Thr Asn Leu Arg Leu Phe Val Gin 
570 575 580 

aac age aac aat gat ttc ctt gtc ate tac ate aac aag act atg aac 2610 
Asn Ser Asn Asn Asp Phe Leu Val He Tyr He Asn Lys Thr Met Asn 
585 590 595 600 

aaa gac gat gac etc acc tac caa aca ttc gat ctt gee act acc aat 2658 
Lys Asp Asp Asp Leu Thr Tyr Gin Thr Phe Asp Leu Ala Thr Thr Asn 
605 610 615 

agt aac atg gga ttc tct ggt gac aag aac gag ctg ate ata ggt get 2706 
Ser Asn Met Gly Phe Ser Gly Asp Lys Asn Glu Leu He He Gly Ala 
620 625 630 

gag age ttt gtc tct aat gag aag att tac ata gac aag ate gag ttc 2754 
Glu Ser Phe Val Ser Asn Glu Lys He Tyr He Asp Lys He Glu Phe 
635 640 645 

att cca gtt caa etc taatagatcc cccgggctgc aggaattctg catgcgtttg 2809 
He Pro Val Gin Leu 
650 



gaegtatget cattcaggtt ggagecaatt 
ctgatgagac atctctgtat tgtgtttctt 
gctaatcgcc aacagattcg gcgatgaata 
gcaaaaaaaa aggaattaga tctgtgtgtg 



tggttgatgt gtgtgcgagt tettgegagt 2869 
tccccagtgt tttctgtact tgtgtaatcg 2929 
aatgagaaat aaattgttct gattttgagt 2989 
ttttttggat ccccggggcg geege 3 044 



<210> 39 
<211> 653 
<212> PRT 

<213> Artificial Sequence 
<400> 39 

Met Ala Asn Pro Asn Asn Arg Ser Glu His Asp Thr He Lys Val Thr 
15 10 15 

Pro Asn Ser Glu Leu Gin Thr Asn His Asn Gin Tyr Pro Leu Ala Asp 
20 25 30 

Asn Pro Asn Ser Thr Leu Glu Glu Leu Asn Tyr Lys Glu Phe Leu Arg 
35 40 45 

Met Thr Glu Asp Ser Ser Thr Glu Val Leu Asp Asn Ser Thr Val Lys 
50 55 60 
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Asp Ala Val Gly Thr Gly He Ser Val Val Gly Gin He Leu Gly Val 
65 70 75 80 

Val Gly Val Pro Phe Ala Gly Ala Leu Thr Ser Phe Tyr Gin Ser Phe 
85 90 95 

Leu Asn Thr He Trp Pro Ser Asp Ala Asp Pro Trp Lys Ala Phe Met 
100 105 110 

Ala Gin Val Glu Val Leu He Asp Lys Lys He Glu Glu Tyr Ala Lys 
115 120 125 

Ser Lys Ala Leu Ala Glu Leu Gin Gly Leu Gin Asn Asn Phe Glu Asp 
130 135 140 

Tyr Val Asn Ala Leu Asn Ser Trp Lys Lys Thr Pro Leu Ser Leu Arg 
145 150 155 160 

Ser Lys Arg Ser Gin Asp Arg He Arg Glu Leu Phe Ser Gin Ala Glu 
165 170 175 

Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Val Ser Lys Phe Glu 
180 185 190 

Val Leu Phe Leu Pro Thr Tyr Ala Gin Ala Ala Asn Thr His Leu Leu 
195 200 205 

Leu Leu Lys Asp Ala Gin Val Phe Gly Glu Glu Trp Gly Tyr Ser Ser 
210 215 220 

Glu Asp Val Ala Glu Phe Tyr Arg Arg Gin Leu Lys Leu Thr Gin Gin 
225 230 235 240 

Tyr Thr Asp His Cys Val Asn Trp Tyr Asn Val Gly Leu Asn Gly Leu 
245 250 255 

Arg Gly Ser Thr Tyr Asp Ala Trp Val Lys Phe Asn Arg Phe Arg Arg 
260 265 270 

Glu Met Thr Leu Thr Val Leu Asp Leu He Val Leu Phe Pro Phe Tyr 
275 280 285 

Asp He Arg Leu Tyr Ser Lys Gly Val Lys Thr Glu Leu Thr Arg Asp 
290 295 300 

He Phe Thr Asp Pro He Phe Leu Leu Thr Thr Leu Gin Lys Tyr Gly 
305 310 315 320 

Pro Thr Phe Leu Ser He Glu Asn Ser He Arg Lys Pro His Leu Phe 
325 330 335 

Asp Tyr Leu Gin Gly He Glu Phe His Thr Arg Leu Gin Pro Gly Tyr 
340 345 350 

Phe Gly Lys Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Glu Thr 
355 360 365 

Arg Pro Ser He Gly Ser Ser Lys Thr He Thr Ser Pro Phe Tyr Gly 
370 375 380 

Asp Lys Ser Thr Glu Pro Val Gin Lys Leu Ser Phe Asp Gly Gin Lys 
385 390 395 400 

Val Tyr Arg Thr He Ala Asn Thr Asp Val Ala Ala Trp Pro Asn Gly 



81 



5/14/2009, EAST Version: 2.3.0.3 



WO 00/11185 PCT/US99/18883 



Gin Lys Asn Glu Thr Ser Thr Gin Thr Tyr Asp Ser Lys Arg Asn Asn 



Trp Thr His Arg Ser Val Asp Phe Phe Asn Thr He Asp Ala Glu Lys 



• He He Glu Gly Pro Gly Phe Thr Gly Gly Asn Leu Leu Phe Leu 



He Tyr He Asn Lys Thr Met Asn Lys Asp Asp Asp Leu Thr Tyr Gin 



Lys Asn Glu Leu He He Gly Ala Glu Ser Phe Val Ser Asn Glu Lys 



<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: synthetic 
oligonucleotide 



taggcctcca tccatggcaa 



<210> 41 
<211s 42 
<212> DNA 
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<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 41 

tcccatcttc ctacttagca ccctgcagaa atacggtcca ac 42 

<210> 42 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 42 

gacctcacct accaaacatt cgatcttg 28 

<210> 43 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 43 

cgagttctac cgtaggcagc tcaag 25 
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